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Adenovirus Vectors, Packaging Cell Lines. Compositions, and Methods for 

Preparation and Use 



This application is a continuation-in-part of U.S. Application 09/423,783 filed 
November 12, 1999 and claims the benefit of the filing date of U.S. Provisional 
Application 60/1 15,920 filed January 14, 1999. 

This invention was made with U.S. government support under NIH Grant No. 
HL 54352. The government has certain rights in the invention. 

The present invention relates to gene therapy, especially to adenovirus-based 
gene therapy. In particular, novel packaging cell lines are disclosed, for use In 
facilitating the development of high-capacity and targeted vectors. High-capacity 
adenovirus vectors are also disclosed herein, as are related compositions, kits, and 
methods of preparation and use of the disclosed vectors, cell lines and kits. 

Enhanced transfer of DNA conjugates into cells has been achieved with 
adenovirus, a human DNA virus which readily infects epithelial cells (Horwitz, 
"Adenoviridae and Their Replication", in Virology, Fields and Knipe, eds., Raven 
Press. NY (1990) pp. 1679-1740). 

There is a need in the art to obtain Adenovirus vectors capable of 
incorporating large segments of foreign DNA and capable of being targeted to 
specific cells, as well as to obtain cell lines which can package such adenovirus- 
gene deficient vectors or targeted vectors. These needs, as well as others, are 
met by the invention. 

This invention utilizes recombinant adenovirus constructs which duplicate the 
cell receptor binding and DNA delivery properties of intact adenovirus virions and 
thus represents an improved method for gene therapy and cell targeting as well as 
for antisense-based antiviral therapy. 

In contrast to the disadvantages of using intact adenovirus, modified 
adenovirus vectors requiring a helper plasmid or virus, or so-called replication- 
deficient adenovirus in the art, the use of recombinant adenovirus-derived vectors 
according to one aspect of the present invention provides certain advantages for 
gene delivery. First, the Ad-derived vectors of the present invention possess all of 
the functional properties required for gene therapy including binding to epithelial 
cell receptors and penetration of endocytic vesicles. Therapeutic viral vectors of 
the present invention may also be engineered to target the receptors of and 
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achieve penetration of non-epithelial cells; means of engineering viral vectors to 
accomplish these ends are described in detail herein below. 

Second, the vectors of the present invention have deletions of substantial 
portions of the Ad genome, which not only limits the ability of the Ad-derived 
vectors to "spread" to other host cells or tissues, but allows significant amounts of 
"foreign" (or non-native) nucleic acids to be incorporated into the viral genome 
without interfering with the reproduction and packaging of the viral genome. 
Therefore, the vectors of the present invention are ideal for use in a wide variety of 
therapeutic applications. 

Third, while the vectors disclosed herein are safe for use as therapeutic 
agents in the treatment of a variety of human afflictions, some of these vectors do 
not require the presence of any "helpers" for propagation and packaging, largely 
because of the novel cell lines in which they are reproduced. Such cell lines - 
referred to herein as packaging cell lines - comprise yet another aspect of the 
invention. 

, To reduce the frequency of contamination with wild-type adenovirus, it is 
desirable to improve either the viral vector or the cell line to reduce the probability 
of recombination. For example, an adenovirus from a group with less homology to 
the group C viruses may be used to engineer recombinant viruses with little 
propensity for recombination with the Ad5 sequence contained in the packaging 
lines. The Invention describes the preparation of packaging cells lines which 
stably expresses adenovirus proteins or polypeptides. These cell lines are useful 
for complementing viral vectors bearing deletions of regulatory and/or structural 
genes, irrespective of the serotype from which such a vector was derived. 

It is also contemplated that the constructs and methods of the present 
invention will support the design and engineering of chimeric viral vectors which 
express amino acid residue sequences derived from two or more Ad serotypes. 
Thus, unlike methods and constructs available prior to the advent of the present 
disclosure, this invention allows the greatest possible flexibility in the design and 
preparation of useful viral vectors and cell lines which support their construction 
and propagation - all with a decreased risk of recombining with wild-type Ad to 
produce potentially-harmful recombinants. 

In part, the present invention discloses a simpler, alternative means of 
reducing the recombination between viral and cellular sequences than those 
discussed in the art. One such means is to increase the size of the deletion in the 
recombinant virus and thereby reduce the extent of shared sequences between 
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that virus and any Ad genes present in a packaging cell line e.g., the Ad5 genes in 
293 cells, or the various Ad genes in the novel cell lines of the present invention. 

Deletions of all or portions of structural genes of the adenovirus have been 
considered undesirable because of the anticipated deleterious effects such 
deletions would have on viral reproduction and packaging. Indeed, the use of 
"helper" viruses or plasmids has often been recommended when using Ad-derived 
vectors containing large deletions in structural protein sequences precisely for this 
reason. 

Contrary to what has been suggested in the art, however, this invention 
discloses the preparation, propagation and use of recombinant Ad-derived vectors 
having deletions of all or part of various gene sequences encoding Ad structural 
proteins, both as a way of reducing the risk of wild-type adenovirus contamination 
in virus preparations, as a way of allowing foreign DNA to be packaged In such 
vectors for a variety of diagnostic and therapeutic applications and as a way of 
targeting an adenovirus vector to a specific cell type. 

The invention further discloses a wide variety of nucleic acid sequences and 
viral vectors. Thus, in one embodiment, the invention discloses a nucleic acid 
sequence encoding any one of the adenovirus fiber proteins mentioned in the 
specification, polypeptides or fragments thereof - including, without limitation, 
those that include deletions or other mutations; those that are chimeric; and those 
that have linkers, foreign amino acid residues, or other molecules attached for 
various purposes as disclosed herein. Nucleic acid sequences encoding various 
other adenovirus structural and/or regulatory proteins or polypeptides are also 
within the scope of the present invention. 

In various embodiments, the adenovirus is a Group C adenovirus selected 
from serotypes 1, 2, 5 or 6; while in other embodiments, adenovirus selected from 
other serotypes, such as for example Ad37 (subgroup D) are useful as disclosed 
herein. 

The invention is also directed to an isolated nucleic acid molecule comprising 
an adenovirus tripartite leader (TPL) nucleotide sequence, said TPL nucleotide 
sequence comprising (a) first and second different TPL exons or (b) first, second 
and third same or different TPL exons, said TPL exons selected from the group 
consisting of complete TPL exon 1 , partial TPL exon 1 , complete TPL exon 2 and 
complete TPL exon 3, wherein said isolated nucleic acid molecule may not 
comprise the nucleic acid sequence consisting of partial TPL exon 1 , complete TPL 
exon 2 and complete TPL exon 3. A preferable embodiment of the invention may 
further comprise an intron operatively linked to the TPL, wherein said intron also 
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contains requisite processing signals for the intron's removal. Another preferable 
embodiment of the invention is directed to the isolated nucleic acid molecule 
wherein said TPL nucleotide sequence consists essentially of complete TPL exon 1 
operatively linked to complete TPL exon 2 operatively linked to complete TPL exon 
3. A related embodiment may further include an intron and appropriate processing 
signals. Additional embodiments of the invention are directed to nucleic acid 
molecules contained in plasmids selected from the group; consisting of pCLF, 
pDV60, pDV67, pDV69, pDV80 and PDV90. Packaging cell lines and adenovirus 
particles containing the nucleic acids described above are also included in the 
invention. 

The invention is further directed to methods for producing an adenovirus 
vector particle containing a helper-independent fiberless recombinant adenovirus 
vector genome comprising providing a) a packaging cell line which complements 
replication and packaging of said genome and b) a helper-independent fiberless 
recombinant adenovirus vector genome which is deficient in expressing sufficient 
functional fiber protein to support assembly of fiber-containing particles. The 
genome is introduced into the cell line. Additional embodiments of the invention 
may also include the following steps; a) growing the ceil line produced under 
conditions for producing particles; and/or b) harvesting an adenovirus vector 
particle containing said helper-independent fiberless recombinant adenovirus 
vector genome. The method may also include a cell line that expresses a fiber 
protein and complements a fiber mutation in the vector. 

The invention is also directed to an adenovirus vector packaging cell line 
comprising a stably integrated nucleic acid molecule as described above, an 
operatively-linked promoter and a nucleic acid sequence which encodes an 
adenovirus structural protein, wherein said TPL sequence consists essentially of a 
first TPL exon operatively linked to a complete second TPL exon operatively linked 
to a complete third TPL exon, wherein said packaging cell comprises an isolated 
nucleic acid molecule comprising an adenovirus tripartite leader (TPL) nucleotide, 
said TPL nucleotide sequence comprising (a) first and second different TPL exons 
or (b) first, second and third different TPL exons, said TPL exons selected from the 
group consisting of complete TPL exon 1 , partial TPL exon 1 , complete TPL exon 2 
and complete TPL exon 3 and, wherein said isolated nucleic acid molecule may not 
comprise the nucleic acid sequence consisting of partial TPL exon 1 , complete TPL 
exon 2 and complete TPL exon 3. Preferably, the cell line may have a complete 
first TPL exon. Another embodiment of the invention comprises adenovirus 
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structural protein, such as adenovirus fiber protein or a chimeric protein which 
includes an adenovirus fiber protein tali domain. 

The invention is further directed to a recombinant adenovirus particle 
comprising a recombinant adenovirus vector genome wherein said genome:(a) 
does not encode or does not express sufficient adenovirus fiber protein to support 
pacl<aging of a fiber-containing adenovirus particle without complementation of 
said fiber gene, and (b) encodes an adenovirus packaging signal and inverted 
terminal repeats containing adenovirus origin of replication. The invention is also 
directed to a helper-independent fiberless recombinant adenovirus vector genome 
comprising genes which (a) encode all adenovirus structural gene products but do 
not express sufficient adenovirus fiber protein to package a fiber-containing 
adenovirus particle without complementation of said fiber gene or said genome 
lacks at least the fibre gene and (b) encodes an exogenous protein. Either of the 
above embodiments may substitute a helper-dependent for a helper-independent 
recombinant adenovirus vector genome. In a preferable embodiment, no fiber 
protein is expressed. In yet another embodiment of the invention, the recombinant 
adenovirus particle fails to express sufficient fiber protein to allow fiber 
incorporation into the particle such that the particle can use the fiber pathway for 
infection. 

The invention is further directed to a method for producing an adenovirus 
vector particle containing a helper-independent fiberless recombinant adenovirus 
vector genome, said method comprising providing a packaging cell line which 
complements replication and packaging of said genome and a helper-independent 
fiberless recombinant adenovirus vector genome which is deficient in expressing 
sufficient functional fiber protein to support assembly of fiber-containing particles, 
wherein said packaging cell comprises an isolated nucleic acid molecule 
comprising an adenovirus tripartite leader (TPL) nucleotide, said TPL nucleotide 
sequence comprising (a) first and second different TPL exons or (b) first, second 
and third different TPL exons, said TPL exons selected from the group consisting 
of complete TPL exon 1 , partial TPL exon 1, complete TPL exon 2 and complete 
TPL exon 3 and, wherein said isolated nucleic acid molecule may not comprise the 
nucleic acid sequence consisting of partial TPL exon 1 , complete TPL exon 2 and 
complete TPL exon and harvesting said adenovirus particles produced by said cell 
line.. In a preferable embodiment the adenovirus particle further comprises an 
exogenous protein or a modified fiber protein. The method may also comprise a 
step of coating a particle (i.e. providing fiber protein in any way) with an adenovirus 
fiber protein. 
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Another aspect of the invention is directed to a method for pseudotyping 
recombinant viral vectors comprising complementing a missing fiber gene of a 
helper-independent ftberless recombinant adenovirus vector genome by 
expressing in packaging cells a fiber gene from a different adenoviral serotype 
than said recombinant adenovirus vector, thereby pseudotyping said vector. An 
additional embodiment of the invention is directed to the method for pseudotyping 
recombinant viral vectors comprising: a) providing a packaging cell line for 
propagating a fiber gene deleted recombinant adenovirus vector, b) introducing 
into said cell line a helper-independent fiberless recombinant adenovirus vector 
genome, and c) complementing the missing fiber gene by expression in the cells of 
a fiber gene from a different adenoviral serotype thereby pseudotyping the vector. 

The invention is further directed to a method for specifically targeting an 
adenovirus vector to a cell of choice comprising introducing a helper-independent 
or helper-dependent fiberless recombinant adenovirus vector genome into a 
packaging cell line for producing a fiber gene-deleted adenovirus vector and 
providing, wherein said gene for a missing fiber protein is complemented with a 
gene for a desired modification for targeting the vector to a cell of choice 

The invention is further directed to a method for producing a modified 
adenovinjs comprising providing in vitro an exogenous fiber protein to a fiberless 
adenovirus. Additional embodiments of the invention may provide any combination 
of all of the following steps such that the invention be directed to a method for 
producing a modified adenovirus comprising: a) providing a packaging cell line for 
producing a fiberless adenovirus vector, b) introducing into said cell line a helper- 
independent fiberless or helper-dependent fiberless recombinant adenovirus vector 
genome, c) growing and harvesting a fiberless adenovirus, d) maintaining the 
fiberless adenovirus in any suitable buffer, and e) providing exogenous fiber, 
wherein said fiber may be a modified fiber, to the fiberless adenovirus by adding 
conditioned media or a soluble fiber preparation or a fiber in any suitable buffer to 
a virus preparation thereby producing the modified adenovirus. 

The invention is further directed to a method for producing a modified 
adenovirus comprising providing a packaging cell line for producing a helper- 
dependent fiberless adenovirus vector genome and providing a helper virus vector, 
wherein said cell line complements at least a deficient fiber protein gene, thereby 
producing the modified adenovirus. Another aspect of the invention is directed to 
a method for producing a modified adenovirus comprising: a) providing a 
packaging cell line for producing a fiberless adenovirus vector, b) introducing into 
said cell line a helper dependent fiberless recombinant adenovirus vector genome 
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and a fiberless helper virus vector, c) growing and harvesting a fiberless 
adenovirus, and d)maintaining the fiberless adenovirus in infectious media, and e) 
providing exogenous fiber to the fiberless adenovirus by adding conditioned media 
or a soluble fiber preparation to a virus preparation thereby producing the modified 
adenovirus. 

Additional aspects of the invention are directed to hybrid Ad/AAV vectors and 
to new helper-dependent vectors used with fiberless adenovirus vectors. 

The invention is also directed to a method for delivering a heterologous gene 
to an EBV-infected B cells comprising infecting said B cells with a pseudotyped 
Adspgal.A F particle or other fiber-deleted adenovirus particle, said particle having 
a chimeric fiber with the receptor-binding knob domain of the adenovirus type 3 
fiber. 

The invention is also directed to an isolated nucleic acid comprising a post- 
transcriptional regulatory element (PRE) and a TPL. Preferably the PRE is the 
woodchuck hepatitis virus PRE (WPRE). 

The invention is further directed to a composition for preparing a therapeutic 
vector, said composition comprising a plasmid comprising an adenovirus genome 
lacking a nucleotide sequence encoding a fiber protein or a genome that is 
incapable of expressing sufficient fiber to result in packaging. 

Another aspect of the invention is directed to a method of delivering a 
heterologous gene to a human or any animal comprising providing an exogenous 
gene to a target cell comprising contacting said cell in vivo or ex vivo with an 
amount of a recombinant adenovirus particle sufficient to infect said cell. 

The invention is also directed to A method for producing a gutless adenoviral 
vector particle comprising: a) delivering a helper adenovirus vector genome to an 
adenovirus vector packaging cell, wherein said helper adenovirus vector genome 
lacks any gene encoding adenovirus fiber protein or lacks the ability to encode 
sufficient adenovirus fiber protein to produce an adenoviral vector comprising fiber 
protein in the absence of complemetation by said packing cell and wherein said 
packaging cell comprises the nucleic acid molecule of claim 2 operably linked to a 
promoter and to an adenoviral fiber protein or to a chimeric protein that includes an 
adenovirus fiber protein tail domain; (b) delivering a gutless adenovirus vector 
genome to said packaging cell; and (c) recovering the gutless adenoviral vector 
particle produced by said cell. 

Another aspect of the invention is directed to a helper adenovirus particle 
comprising an adenovirus vector genome that does not encode or does not 
express sufficient adenovirus fiber protein to support packaging of a fiber- 
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containing adenovirus particle without complementation of said fiber gene, wherein 
said genome has a mutation in its packaging sequence that renders said genome 
substantially incapable of being packaged. Packaging sequence are those 
sequences are those sequences involved in packaging the viral particle. 

The invention is further directed to a helper adenovirus particle comprising an 
adenovirus vector genome with recombinase sites flanking its packaging 
sequence, wherein said vector genome does not encode or does not express 
sufficient adenovims fiber protein to support packaging of a fiber-containing 
adenovirus particle without complementation of said fiber gene. 

The invention is also directed to an adenovirus particle comprising a gutless 
adenoviral vector genome and a fiberless capsid. as well as an adenovirus particle 
comprising a gutless adenoviral vector genome and a capsid comprising a modified 
fiber protein. 

Another aspect of the invention is directed to a packaging cell for the 
production of a fiberless or fiber-modified gutless adenovirus particle comprising an 
adenovirus vector complementing plasmid and a nucleotide sequence encoding a 
recombinase, wherein said complementing plasmid comprises the nucleic acid 
molecule of claim 2 operably linked to a promoter and to a nucleotide sequence 
encoding an adenoviral fiber protein or a chimeric adenoviral fiber protein. 
Preferably the cell line may comprise a recombinase. In an embodiment of the 
invention the recombinase may be Cre. 

In another embodiment of the invention, the fiber-deleted adenovirus vectors 
of the invention and the fiber-complementing adenovirus packaging cells of the 
invention are used to produce a gutless adenovirus vector particle. Such particle 
comprises a gutless adenoviral vector genome in an adenoviral capsid. The fiber 
proteins of the capsid may be wild-type fiber, or the modified fiber proteins 
disclosed herein. Alternatively, such particle may have a fiberless capsid as 
disclosed herein. Preferably, the gutless genome contains at least one 
heterologous gene as described herein. As used herein, the term "gutless 
adenoviral vector genome" means an adenoviral vector genome from which all of 
the viral genes have been deleted.. 

The invention also discloses systems or kits for use in any of the 
aforementioned methods. The systems or kits may contain any appropriate 
combination of the within-described vectors, plasmids, cell lines, virus particles and 
additional therapeutic agents as disclosed. Preferably, each such kit or system 
includes a quantity of the appropriate therapeutic substance or sequence sufficient 
for at least one administration, and instructions for administration and use. Thus, 
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one system further comprises an effective amount of a therapeutic agent which 
enhances the therapeutic effect of the therapeutic viral vector-containing 
composition. Another variation discloses that the composition and the therapeutic 
agent are each included in a separate receptacle or container. 

It will also be appreciated that any combination of the preceding elements 
may also be efficacious as described herein, and that all related methods are also 
within the scope of the present invention. 

< Figure 1 is a schematic diagram of the entire adenoviral E4 transcriptional 
unit with the open reading frames (ORF) indicated by blocked segments along with 
the promoter and terminator sequences. The location of primers for amplifying 
specific portions of E4 are also indicated as further described in Example 1 A. 

Figure 2 is a schematic map of plasmid pE4/Hygro as further described in 
Example 1B. 

Figure 3 is a schematic map of plasmid pCDNA3/Fiber as further described 
in Example IB. 

Figure 4 is a schematic map of plasmid pCLF as further described in 
Example 1B. 

Figure 5 is a photograph of a Southern blot showing the presence of Intact 
adenovirus E4 3.1 kilobase (kb) insert in the 211 cell line as further described in 
Example 1C. 

Figure 6 is an autoradiograph showing labeled fiber protein 
immunoprecipitated from cells and electrophoresed under native and denaturing 
electrophoresis conditions as described in Example 1C. The 293 cells lack fiber 
while the sublines 211 A, 21 IB and 21 1R contain fiber protein detectable in 
functional trimerized form and denatured monomeric form. 

Figure 7 is a schematic map of plasmid pDEX/EI as further described in 
Example ID. 

Figure 8 is a schematic map of plasmid pEI/Fiber as further described in 
Example 1F1. 

Figure 9 is a schematic map of plasmid pE4/Fiber as further described in 
Example 1F2). 

Figure 10 is a schematic illustration of linearized pD EIBb gal delivery 
plasmids for use in cotransfection and recombination to form a recombinant 
adenoviral vector having multiple adenoviral gene deletions. The plasmids and 
recombination event are more fully described in Example 2A. 
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Figure 11 is a schematic of plasmid pi 1.3 as further described in Example 
2A used in the construction of pDV44 delivery plasmid. 
Figure 12 is a schematic of plasmid 8.2. 

Figure 13 shows the trimeric structure of the recombinant fiber. 293, 211 A. 
21 IB, or 211 R cells as indicated were metaboiically labeled with [^S]methionine, 
soluble protein extracts prepared, and fiber was immunoprecipitated. A portion of 
the precipitated protein was electrophoresed on an 8% SDS-PAGE gel under either 
semi-native or denaturing conditions. The positions of trimeric (T) and monomeric 
(M) fiber are indicated. As a control for electrophoretic conditions, recombinant 
Ad2 fiber produced in baculovirus-infected cells was run under identical conditions 
and stained with Coomassie blue. Figure 14 shows the complementation of a 
fiber mutant adenovirus by fiber-producing cells. The cell lines indicated (2x10^ 
cells per sample) were infected with the temperature-sensitive fiber mutant 
adenovirus H5fs142 at 10 PFU/cell and incubated at either the pemnissive (32.5**C, 
stippled bars) or the restrictive (39.5°C, solid bars) temperature. 48 hours post- 
infection, virus was isolated by freeze-thaw lysis and yields determined by 
fluorescent focus assay on SW480 cells. Each value represents the mean of 
duplicate samples, and the data shown is representative of multiple experiments. 

Figure 15 shows the incorporation of the recombinant Ad5 fiber into Ad3 
particles. In Figure 15A, the alignment of the N-terminal (penton base-binding) 
domains of fiber proteins from several different adenovirus serotypes is shown. 
From top to bottom, the five different serotypes are listed as SEQ ID NOs 21-25. In 
Figure 15B, type 3 adenovirus was propagated in 293, 21 IB, or 21 1R cells as 
indicated and purified by two sequential CsCI centrifugations. 10 pg of the purified 
viral particles was then electrophoresed under denaturing conditions and 
transferred to a PVDF membrane. Ad5 fiber was detected with a polyclonal rabbit 
antibody raised against recombinant Ad2 fiber. As a positive control for detection, 
400 ng of wild-type Ad2 was run in the lane marked "Ad2". Under these conditions, 
the mobilities of the Ad2 and Ad5 fibers are indistinguishable and the antibody 
reacts with both proteins. 

Figure 16 shows the fiber deletion in pDV44 and the genomic structures of 
the AdS.pgaLAF and Ad5.pgal.wt vectors: Figure 16A shows pDV44 that was 
constructed by removing the fiber gene and residual E3 sequences (nt 
30819:32743 of ADS) from pBHGIO. Figures 16B shows viruses constructed by 
cotransfection of either pBHGIO or pDV44 with pAEIBPgal. Both are E1/E3 
deleted Ad5 vectors, and Ad5.pgal.AF has the additional fiber (L5) deletion as in 
pDV44. 



wo 00/42208 



PCT/EPOO/00265 



- 11- 

Figure 17 shows the analysis of the viral chromosomes. Figure 17A shows 
the predicted EcoRI restriction maps of Ad5.pgal.wt and Ad5.pgal.AF. The 5.9 kb 
fragment at the right end of the Ad5.pgal.wt genome is reduced to 4.0 kb by the 
deletion of fiber sequences in Ad5.pgal.AF. Figure 17B shows an ethidium 
bromide-stained gel of EcoRI-digested viral DNA. Figure 17C shows a Southern 
blot of the gel as described in Example 2 probed either with labeled fiber or E4 
sequences. 

Figure 18 shows the analysis of vertex proteins in the viral particles. 293 
(non-fiber expressing) or 21 1B (fiber-expressing) cells were infected with 
Ad5.pgal.wt (W) or with Ad5.pgal.AF(*AF') and the resulting viral particles were 
purified on CsCI gradients. 10 pg of purified virions was then electrophoresed on 
5-16% gradient gels and Western blotted. Proteins were detected with polyclonal 
anti-fiber or anti-penton base antibodies. 

Figure 19 shows the infectivity of Ad particles on THP-1 monocytic cells. 
Figure 19A shows THP-1 cells that were infected with Ad5.pgal.wt or with fiberless 
Ad5.pgal.AF at 100,000 particles/cell. Forty-eight hours after infection, cells were 
fixed and stained with X-gal and the fraction of infected cells was determined by 
light microscopy. Figure 19B shows cells that were infected with 1000 particles per 
cell of Ad5.pgal.wt or with 100,000 particles/cell of Ad5.pgal.AF. As indicated, cells 
were pretreated with 100 pg/ml of recombinant penton base or with 20 pg/ml of 
recombinant Ad2 fiber. 

Figure 20 shows a schematic of improved fiber-complementing cell lines, 
633 and 644 as further described in the Examples, 

Figures 21 and 22 illustrates pseudotyping of fiberless particles with fiber 
proteins and infectivity data as further described in the Examples. 

Figure 23 shows the Clal to Bglll fragment of Ad5. 

Figure 24 shows the plasmid pGRE5-2/EBV 

Figure 25 shows the plasmid pGRE5-E1 . 

Figure 26 shows the plasmid pSE280-E2 BamHI-Smal. 

Figure 27. The fiber-deleted adenovirus vector Ad5.bgal.DF was grown in 
cells expressing either no fiber (293; •Ad5.bgal.DF/0*), the Ad5 fiber (633; 
'Ad5.bgal.DF/5F'), or the Ad37 fiber with modifications as described in the text 
(705; *Ad5.bgal.DF/37F') and CsCI-purified. 10 pg of the purified particles were 
electrophoresed and transferred to a nylon membrane. As controls, 10 pg of wild- 
type Ad37 or the fiber gene-containing vector Ad5.bgal.wt or a sample of purified 
recombinant Ad37 fiber knob were also run. The blot was probed with polyclonal 
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antisera against recombinant Ad37 fiber or Ad2 fiber proteins. As a loading 
control, the same filter was reprobed with an antibody against the Ad2 penton base 
(the anti-Ad2 sera cross-recognized the very similar Ad5 fiber and Ad5 penton 
base proteins). 

Figure 28 shows PGR analysis for fiber presence. 

Figure 29 shows the transduction efficiency for fiberless virus with and 
without soluble fiber. 

Figure 30 shows the transduction efficiency of ADSBgF" on HDF cell line 
with the presence of different amounts of 633 conditioned media. 

To reduce the frequency of contamination with wild-type adenovirus, it is 
considered desirable to improve either the viral vector or the cell line to reduce the 
probability of recombination. For example, an adenovirus from a group with less 
homology to the group C viruses may be used to engineer recombinant viruses 
with little propensity for recombination with the Ad5 sequence in 293 cells. 
Similarly, an epithelial cell line - e.g. the cell line known as 293 - may be used or 
further modified according to within-disclosed methods which stably expresses 
adenovirus proteins or polypeptides from Ad3 and/or proteins or polypeptides from 
another non-group-C or group C serotype; such a cell line would be useful to 
support adenovirus-derived viral vectors bearing deletions of regulatory and/or 
structural genes, irrespective of the serotype from which such a vector was derived. 

It is also contemplated that the constructs and methods of the present 
invention will support the design and engineering of chimeric viral vectors which 
express amino acid residue sequences derived from two or more Ad serotypes. 
Thus, unlike methods and constructs available prior to the advent of the present 
disclosure, this invention allows the greatest possible flexibility in the design and 
preparation of useful viral vectors and cell lines which support their construction 
and propagation - all with a decreased risk of recombining with wild-type Ad to 
produce potentially-harmful recombinants. 

In part, the present invention discloses a simpler, alternative means of 
reducing the recombination between viral and cellular sequences than those 
discussed in the art. One such means is to increase the size of the deletion in the 
recombinant virus and thereby reduce the extent of shared sequences between 
that 

virus and any Ad genes present In a packaging cell line - e.g., the Ad5 genes in 
293 cells, or the various Ad genes in the novel cell lines of the present invention. 
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Therefore, the present invention makes it feasible to engineer and produce 
novel viral vectors that are able to package and deliver significantly larger foreign 
nucleic acid sequences for efficacious use In a variety of therapeutic applications, 
without endangering the subject to whom they are administered, due to their 
impaired ability to self-replicate in non-complementing cell lines. Due to the fact 
that "helper** viruses or plasmids need not be used in conjunction with many of the 
viral vectors of the present invention, those vectors of the present invention are 
also simpler to use than those previously described in the art. 

In order to provide a clearer understanding of the specification and claims, 
the following definitions are provided. 

Adenoviral Vector or Ad-Derived Vector Any adenovirus-derived plasmid, 
genome or virus into which a foreign DNA may be inserted or expressed. This term 
may also be used interchangeably with "viral vector." This "type" of vector may be 
utilized to carry nucleotide sequences encoding therapeutic proteins or 
polypeptides to specific cells or cell types in a subject in need of treatment, as 
described further herein below. 

Amino Acid Residue: An amino acid formed upon chemical digestion 
(hydrolysis) of a polypeptide at its peptide linkages. The amino acid residues 
described herein are preferably in the "L" isomeric form. However, residues in the 
"D" isomeric form can be substituted for any L-amino acid residue, as long as the 
desired functional property is retained by the polypeptide. NH2 refers to the free 
amino group present at the amino terminus of a polypeptide, COOH refers to the 
free carboxy group present at the carboxyl terminus of a polypeptide. Standard 
polypeptide nomenclature described in J. Biol, Chem., 243:3552-59 (1969) and 
adopted at 37 C.F.R. § § 1 .821 - 1 .822 is used. 

It should be noted that all amino acid residue sequences represented herein 
by formulae have a left to right orientation in the conventional direction of 
amino-terminus to carboxyl-terminus. In addition, the phrase "amino acid residue" 
is broadly defined to include the amino acids listed in the Table of Correspondence 
and modified and unusual amino acids, such as those referred to in 37 C.F.R. § § 
1.821-1.822, and incorporated herein by reference. Furthermore, It should be 
noted that a dash at the beginning or end of an amino acid residue sequence 
indicates a peptide bond to a further sequence of one or more amino acid residues 
or to an amino-terminal group such as NH2 or to a carboxyl-terminal group such as 



COOH. 
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Complementing Plasmid: This term is generally used herein to 
describe plasmid vectors used to deliver particular nucleotide sequences into a 
packaging cell line, with the intent of having said sequences stably integrate into 
the cellular genome. 

Delivery Plasmich This term is generally used herein to describe a 
plasmid vector that carries or delivers nucleotide sequences in or into a cell line 
{e.g., a packaging cell line) for the purpose of propagating therapeutic viral vectors 
of the present invention. 

DMA Homolog: A nucleic acid having a preselected conserved nucleotide 
sequence and a sequence encoding a preferred polypeptide according to the 
present invention, where the nucleic acid is substantial homologous to a named 
preferred embodiment. By the term "substantially homologous" is meant having at 
least 80%, preferably at least 90%, most preferably at least 95% homology 
therewith. 

The terms "homology" and "identity" are often used interchangeably. In this 
regard, percent homology or identity may be determined, for example, by 
comparing sequence information using a GAP computer program. The GAP 
program utilizes the alignment method of Needleman and Wunsch (J. MoL BioL 
48:443 (1970), as revised by Smith and Waterman {Adv. AppL Math. 2:482 (1981). 
Briefly, the GAP program defines similarity as the number of aligned symbols (I.e., 
nucleotides or amino acids) which are similar, divided by the total number of 
symbols in the shorter of the two sequences. The preferred default parameters for 
the GAP program may include: (1) a unary comparison matrix (containing a value 
of 1 for identities and 0 for non-identities) and the weighted comparison matrix of 
Gribskov and Burgess, Nucl. Acids Res. 14:6745 (1986), as described by Schwartz 
and Dayhoff, eds.. ATLAS OF PROTEIN SEQUENCE AND STRUCTURE, National 
Biomedical Research Foundation, pp. 353-358 (1979); (2) a penalty of 3.0 for each 
gap and an additional 0.10 penalty for each symbol in each gap; and (3) no penalty 
for end gaps. 

Whether any two nucleic acid molecules have nucleotide sequences that 
are at least 80%, 85%, 90%, 95%, 96%. 97%, 98% or 99% "identical" can be 
determined using known computer algorithms such as the "FAST A" program, using 
for example, the default parameters as in Pearson and Lipman, Proc. Natl. Acad, 
ScL USA 85:2444 (1988). Altematively the BLAST function of the National Center 
for Biotechnology Information database may be used to determine identity 

In general, sequences are aligned so that the highest order match is 
obtained. "Identity" per se has an art-recognized meaning and can be calculated 
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using published techniques. (See, e.gr.: Computational Molecular Biology, Lesk, 
A.M., ed., Oxford University Press, New York, 1988; Biocomputing: Informatics and 
Genome Projects, Smith, D,W., ed., Academic Press, New York, 1993; Computer 
Analysis of Sequence Data, Part /, Griffin, A.M., and Griffin, H.G., eds., Humana 
Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, G., 
Academic Press, 1987; and Sequence Analysis Primer, Gribskov. M. and 
Devereux, J., eds., M Stockton Press, New York, 1991). While there exist a 
number of methods to measure identity between two polynucleotide or polypeptide 
sequences, the term "identity" is well known to skilled artisans (Carillo, H. & Lipton, 
D.. SIAM J Applied Math 48:^073 (1988)). Methods commonly employed to 
determine identity or similarity between two sequences include, but are not limited 
to, those disclosed in Guide to Huge Computers, Martin J. Bishop, ed.. Academic 
Press, San Diego, 1994, and Carillo, H. & Lipton, D., SIAM J Applied Math 48:1073 
(1988). Methods to determine identity and similarity are codified in computer 
programs. Preferred computer program methods to determine identity and 
similarity between two sequences include, but are not limited to, GCG program 
package (Devereux, J., ef a/., Nucleic Acids Research 12(I)\3Q7 (1984)), BLASTP, 
BLASTN, FASTA (Atschul, S.F., etaL, J Molec Biol 215:403 (1990)). 

Therefore, as used herein, the term "identity" represents a comparison 
between a test and a reference polypeptide or polynucleotide. For example, a test 
polypeptide may be defined as any polypeptide that is 90% or more identical to a 
reference polypeptide. As used herein, the term at least "90% identical to" refers 
to percent identities from 90 to 99.99 relative to the reference polypeptides. 
Identity at a level of 90% or more is indicative of the fact that, assuming for 
exemplification purposes a test and reference polynucleotide length of 100 amino 
acids are compared. No more than 10% (i.e., 10 out of 100) amino acids in the test 
polypeptide differs from that of the reference polypeptides. Similar comparisons 
may be made between a test and reference polynucleotides. Such differences may 
be represented as point mutations randomly distributed over the entire length of an 
amino acid sequence or they may be clustered in one or more locations of varying 
length up to the maximum allowable, e.g. 10/100 amino acid difference 
(approximately 90% identity). Differences are defined as nucleic acid or amino 
acid substitutions, or deletions. 

An embodiment of the invention may use polynucleotides at least 90% or 
95% identical to those encoding the TPL nucleic acid sequences. A further 
embodiment of the invention may Include those polynucleotides that encode a 
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polypeptide of interest that are at least 95% identical when the variation in such a 
polynucleotide is due to more than merely degenerate changes. 

Expression or Delivery Vector: Any plasmid or virus into which a foreign 
DNA may be inserted for expression in a suitable host cell /.e., the protein or 
polypeptide encoded by the DNA is synthesized in the host cell's system. Vectors 
capable of directing the expression of DNA segments (genes) encoding one or 
more proteins are referred to herein as "expression vectors." Also included are 
vectors which allow cloning of cDNA (complementary DNA) from mRNAs produced 
using reverse transcriptase. 

Foreign Gene: This term is used to identify a DNA molecule not present in 
the exact orientation and position as the counterpart DNA molecule found in wild- 
type adenovirus. It may also refer to a DNA molecule from another organism or 
species {i.e., exogenous) or from another Ad serotype. 

Gene: A nucleic acid whose nucleotide sequence encodes an RNA or 
polypeptide. A gene can be either RNA or DNA. Genes may include regions 
preceding and following the coding region (leader and trailer) as well as 
intervening sequences (introns) between individual coding segments (exons). 

Isolated: This term is used to indicate a nucleic acid or polypeptide 
sequence separated from the genetic environment from which the sequences were 
obtained. It may also mean altered from the natural state. For example, a 
polynucleotide or a polypeptide naturally present in a living animal is not "isolated," 
but the same polynucleotide or polypeptide separated from the coexisting materials 
of its natural state is "isolated", as the term is employed herein. Thus, a 
polypeptide or polynucleotide produced and/or contained within a recombinant host 
cell is considered isolated for purposes of the present invention. Also intended as 
an "isolated polypeptide" or an "isolated polynucleotide" are polypeptides or 
polynucleotides that have been purified, partially or substantially, from a 
recombinant host cell or from a native source. For example, a recombinantly 
produced version of a compounds can be substantially purified by the one-step 
method described in Smith and Johnson, Gene 67.-31-40 (1988). The terms 
isolated and purified are sometimes used interchangeably. 

By "isolated" is meant that the DNA is free of the coding sequences of those 
genes that, in the naturally-occurring genome of the organism (if any) from which 
the DNA of the invention is derived, immediately flank the gene encoding the DNA 
of the invention. The isolated DNA may be single-stranded or double-stranded, 
and may be genomic DNA, cDNA, recombinant hybrid DNA, or synthetic DNA, It 
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may be identical to a native DNA sequence, or may differ from such sequence by 
the deletion, addition, or substitution of one or more nucleotides. 

Isolated or purified as it refers to preparations made from biological cells or 
hosts should be understood to mean any cell extract containing the indicated DNA 
or protein including a crude extract of the DNA or protein of interest. For example, 
in the case of a protein, a purified preparation can be obtained following an 
individual technique or a series of preparative or biochemical techniques and the 
DNA or protein of interest can be present at various degrees of purity in these 
preparations. The procedures may include for example, but are not limited to, 
ammonium sulfate fractionation, gel filtration, ion exchange change 
chromatography, affinity chromatography, density gradient centrifugation and 
electrophoresis. 

A preparation of DNA or protein that is "pure" or "isolated" should be 
understood to mean a preparation free from naturally occurring materials with 
which such DNA or protein is normally associated in nature. "Essentially pure" 
should be understood to mean a "highly" purified preparation that contains at least 
95% of the DNA or protein of interest, 

A cell extract that contains the DNA or protein of interest should be 
understood to mean a homogenate preparation or cell-free preparation obtained 
from cells that express the protein or contain the DNA of interest. The term "cell 
extract" is intended to include culture media, especially spent culture media from 
which the cells have been removed. 

Packaging Cell line: A packaging cell line is a cell line that provides a 
missing gene product or its equivalent. 

Particle: The adenovirus (Ad) particle is relatively complex and may be 
resolved into various substructures. The particle is the minimal structural or 
functional unit of a virus. A virus can refer to a single particle, a stock of particles 
or a viral genome. 

Penton: The terms "penton" or "penton complex" are preferentially used 
herein to designate a complex of penton base and fiber. The term "penton" may 
also be used to indicate penton base, as well as penton complex. The meaning of 
the term "penton" alone should be clear from the context within which it is used. 
Plasmid, An autonomous self-replicating extrachromosomal circular DNA 
Post-transcriptional Regulatory Element (PRE) is a regulatory element found 
in viral or cellular messenger RNA that is not spliced, i.e. intronless messages. 
Examples include, but are not limited to, human hepatitis virus, woodchuck 
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hepatitis virus, the TK gene and mouse histone gene. The PRE may be placed 
before a polyA sequence and after a heterologous DNA sequence. 

Pseudotyping: This term as generally used herein describes the production 
of adenoviral vectors having modified capsid protein or capsid proteins from a 
different serotype than the serotype of the vector itself. One example, is the 
production of an adenovirus 5 vector particle containing a chimeric Ad3/Ad5 fiber 
protein. This may be accomplished by producing the adenoviral vector in 
packaging cell lines expressing different fiber proteins. 

Promoter: Useful promoters according to the present invention may be 
inducible or constitutive. Inducible promoters will initiate transcription only in the 
presence of an additional molecule; constitutive promoters, on the other hand, do 
not require the presence of any additional molecule to regulate gene expression. 
A regulatable or inducible promoter may also be described as a promoter wherein 
the rate of RNA polymerase binding and initiation is modulated by external stimuli. 
Such stimuli include various compounds or compositions, light, heat, stress, 
chemical energy sources, and the like. Inducible, suppressive and repressible 
promoters are considered regulatable promoters. 

Receptor: Receptor is a term used herein to indicate a biologically active 
molecule that specifically binds to (or with) other molecules. The term "receptor 
protein" may be used to more specifically indicate the proteinaceous nature of a 
specific receptor. 

Recombinant As used herein, the temn is intended to refer to any progeny 
formed as the result of genetic engineering. This may also be used to describe a 
virus formed by recombination of plasmids in a packaging cell. 

Transgene or Therapeutic Nucleotide Sequence: As described and claimed 
herein, such a sequence includes DNA and RNA sequences encoding an RNA or 
polypeptide. Such sequences may be "native" or naturally-derived sequences; 
they may also be "non-native" or "foreign" sequences which are naturally- or 
recombinantly-derived. The term "transgene," which may be used interchangeably 
herein with the term '^therapeutic nucleotide sequence," is often used to describe a 
heterologous or foreign (exogenous) gene that is carried by a viral vector and 
transduced Into a host cell. 

Therefore, therapeutic nucleotide sequences may also include antisense 
sequences or nucleotide sequences which may be transcribed into antisense 
sequences. Therapeutic nucleotide sequences (or transgenes) further comprise 
sequences which function to produce a desired effect In the cell or cell nucleus 
into which said therapeutic sequences are delivered. For example, a therapeutic 
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nucleotide sequence may encode a functional protein intended for delivery into a 
cell which is unable to produce that functional protein. 

Adenovirus 

Fiber plays a crucial role in adenovirus infection by attaching the virus to a 
specific receptor on the cell surface. The fiber is an elongated protein which 
exists as a trimer of three identical polypeptides (polypeptide IV) of 582 amino 
acids in length. An adenovirus fiber consists of three domains: an N-terminal tail 
domain that interacts with penton base; a shaft composed of variable numbers of 
repeats of a 15-amino-acld segment that forms beta-sheet and beta-bends; and a 
knob at the C-terminus ("head domain") that contains the type-specific antigen and 
is responsible for binding to the cell surface receptor. The gene encoding the 
fiber protein from Ad2 has been expressed in human cells and has been shown to 
be correctly assembled into trimers, glycosylated and transported to the nucleus. 
(See, e.g.. Hong and Engler, Virology 185: 758-761, 1991). Thus, alteration of the 
fiber in recombinant Ad vectors can lead to alteration in gene delivery. This has 
great utility for a variety of gene therapy applications and is one of the objects of 
the present invention. 

Hexon, penton and fiber capsomeres are the major components on the 
surface of the virion. Their constituent polypeptides, nos. II, III and IV. contain 
tyrosine residues that are exposed on the surface of the virion and can be labeled 
- e.g., by iodination of intact particles. 

The 35,000+ base pair (bp) genome of adenovirus type 2 has been 
sequenced and the predicted amino acid sequences of the major coat proteins 
(hexon, fiber and penton base) have been described. (See, e.g., Neumann et aL, 
Gene 69: 153-157 (1988); Herisse et aL, Nuc. Acids Res, 9: 4023-4041 (1981); 
Roberts et aL, J. BioL Oiem. 259: 13968-13975 (1984); Kinloch et aL, J. BioL 
Chem. 259: 6431-6436 (1984); and Chroboczek et aL, ViroL 161: 549-554, 1987). 

The sequence of Ad5 DNA was completed more recently; its sequence 
includes a total of 35,935 bp. Portions of many other adenovirus genomes have 
also been sequenced. It is presently understood that the upper packaging limit for 
adenovirus virions is about 105% of the wild-type genome length. (See, e.g., Bett, 
etaL, J. ViroL 67(10): 59^^•2^, 1993). Thus, for Ad2 and Ad5, this would be an 
upper packaging limit of about 38kb of DNA. 

Adenovirus DNA also includes inverted terminal repeat sequences (ITRs) 
ranging in size from about 100 to 150 bp, depending on the serotype. The 
inverted repeats enable single strands of viral DNA to circularize by base-pairing 
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of their terminal sequences, and the resulting base-paired "panhandle" structures 
required for replication of the viral DNA. 

For efficient packaging, the ITRs and the packaging signal (a few hundred 
bp in length) comprise the "minimum requirement" for replication and packaging of 
a genomic nucleic acid Into an adenovirus particle. Helper-dependent vectors 
lacking all viral ORFs but including these essential cis elements (the ITRs and 
contiguous packaging sequence) have been constructed, but the virions package 
less efficiently that the helper and package as multimers part of the time, which 
suggests that the virus may "want" to package a fuller DNA complement (see. e.g., 
Fisher. etaL, Virology 217: 11-22, 1996). 

The viral vectors of the present invention may retain their ability to express 
the genome packaged within - Ae., they may retain their "infectivity" - they do not 
act as infectious agents, however, to the extent that they cause disease in the 
subjects to whom they are administered for therapeutic purposes. 

It is to be appreciated that Ad vectors have several distinct advantages 
over other viral vectors in the art. For example, recombination of such vectors is 
rare; there are no known associations of human malignancies with adenoviral 
infections despite common human infection with adenoviruses; the genome may 
be manipulated to accommodate foreign genes of a fairly substantial size; and 
host proliferation is not required for expression of adenoviral proteins. 

An extension of this invention is that the Ad-derived viral vectors disclosed 
herein may be used to target and deliver genes into specific cells by incorporating 
the attachment sequence for other receptors (such as CD4) onto the fiber protein 
by recombinant DNA techniques, thus producing a chimeric rholecule. This should 
result in the ability to target and deliver genes into a wide range of cell types with 
the advantage of evading recognition by the host's immune system. The within- 
disclosed delivery systems thus provide for increased flexibility in gene design to 
enable gene delivery into proliferating and nonproliferating cell types. 

For example, U.S. Patent Nos. 5,756,086 and 5,543.328 as well as, 
W095/26412 and WO 98/44121 and Krasnykh, et al. (J. ViroL 70: 6839-46. 1996) 
describe modifications that may be made to the adenovirus fiber protein. Such 
modifications are useful in altering the targeting mechanism and specificity of 
adenovirus and could readily be utilized in conjunction with the constructs of the 
present invention to target the novel viral vectors disclosed herein to different 
receptors and different cells. Moreover, modifications to fiber protein which alter 
its tropism may permit greater control over the localization of viral vectors in 
therapeutic applications. 
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Similarly, incorporation of various structural proteins into cell lines of the 
present invention, whether or not those proteins are modified, is also 
contemplated by the present invention. Thus, for example, modified, penton base 
polypeptides such as those described in Wickham, et aL {J. ViroL 70. 6831-8, 
1996) may have therapeutic utility when used according to the within-disclosed 
methods. 

While some of the Examples appearing below specifically recite fiber 
proteins, polypeptides, and fragments thereof, it is expressly provided herein that 
other structural and non-structural Ad proteins and polypeptides (e.p,, regulatory 
proteins and polypeptides) may be used as components of the various disclosed 
vectors and cell lines. Moreover, chimeric molecules comprised of proteins, 
polypeptides, and/or fragments thereof which are derived from different Ad 
serotypes may be used in any of the within-disclosed methods, constructs and 
compositions. Similariy, recombinant DNA sequences of the present invention 
may be prepared using nucleic acid sequences derived from different Ad 
serotypes, in order to design useful constructs with broad applicability, as 
disclosed and claimed herein. 

It should also be appreciated that, while the members of Group C or Group 
D adenovirus - /.e., Ad serotypes 1, 2, 5, 6 or 37 - are specifically recited in 
various examples herein, the present invention is in no way limited to those 
serotypes alone. In view of the fact that the adenovirus serotypes are all closely- 
related in structure and functionality, therapeutic viral vectors, packaging cell 
lines, and plasmids of the present invention may be constructed from components 
of any and all Ad serotypes - and the within-disclosed methods of making and 
using the various constructs and cell lines of the present invention apply to all of 
said serotypes. 

The family of Adenoviridae includes many members with at least 47 known 
serotypes of human adenovirus (Ad1-Ad47) (Shenk, Virology, Chapter 67, in 
Fields et aL, eds. Lippincott-Raven, Philadelphia, 1996.) as well as members of 
the genus Mastadenovirus including human, simian, bovine, equine, porcine, 
ovine, canine and opossum viruses, and members of the Aviadenovirus genus, 
including bird viruses, e.g. CELO. Thus it is contemplated that the disclosed 
inventions can be applied to any adenovirus species, and the invention need not 
be so limited. One of skill in the art would have knowledge of the different 
adenoviruses as evidenced by (Shenk, Virology, Chapter 67, in Fields et aL, eds. 
Lippincott-Raven, Philadelphia, 1996,) which is herein incorporated by reference. 
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Packaging Cell Lines A. Adenovirus Complementation Genetics 

The first generation of recombinant adenoviral vectors currently available 
typically have a deletion in the first viral early gene region which is generally 
referred to as El, which comprises the E1a and E1b regions. (These regions 
span genetic map units 1 .30 to 9.24.) Figure 3 in chapter 67 of Fields Virology, 
3d Ed. (Fields et aL eds, Lippincott-Raven Publ., Philadelphia, 1996, p. 2116) 
illustrates a transcription and translation map of adenovirus type 2 (Ad2). 

Deletion of the viral El region renders the recombinant adenovirus 
defective for replication and incapable of producing infectious viral particles in 
subsequently-infected target cells. Thus, to generate E1 -deleted adenovirus 
genome replication and to produce virus particles requires a system of 
complementation which provides the missing El gene product. El 
complementation is typically provided by a cell line expressing El , such as the 
human embryonic kidney packaging cell line, i.e. an epithelial cell line, called 293. 
Cell line 293 contains the El region of adenovirus, which provides El gene region 
products to "support" the growth of El -deleted virus in the cell line (see, e.g., 
Graham et aL, J, Gen, Virol. 36: 59-71, 1977). Additionally, cell lines that may be 
usable for production of defective adenovirus having a portion of the adenovirus 
E4 region have been reported (WO 96/22378). Multiply deficient adenoviral 
vectors and complementing cell lines have also been described (WO 95/34671, 
U.S. Patent No. 5,994,106). Nevertheless, inherent problems exist conceming 
first-generation recombinant adenoviruses. 

B. Adenovirus Particle Packaging Cell Lines 

Packaging cell lines disclosed herein support viral vectors with deletions of 
major portions of the viral genome, without the need for helper viruses. 
Additionally, the invention provides novel cell lines and helper viruses for use with 
helper-dependent vectors. 

Thus, in one embodiment of the present invention, a packaging cell line is 
disclosed having DNA sequences stably integrated into the cellular genome 
wherein the DNA sequences encode one or more adenovirus regulatory and/or 
structural polypeptides which complement the genes deleted or mutated in the 
adenovirus vector genome to be replicated and packaged. 

In another embodiment, the packaging cell line expresses one or more 
adenovirus structural proteins, polypeptides, or fragments thereof, wherein said 
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structural protein is selected from the group consisting of penton base, hexon. 
fiber, polypeptide Ilia, polypeptide V, polypeptide VI, polypeptide VII, polypeptide 
VIII, and biologically active fragments thereof. 

In one variation, the sequences are constitutively expressed; in another, 
one or more sequences is under the control of a regulatable promoter. In a 
preferred embodiment expression is constitutive. In various preferred 
embodiments, the polypeptides expressed by the DNA sequences are biologically 
active. 

In a further and preferred embodiment the packaging cell line of the present 
invention supports the production of a viral vector. In a preferred embodiment the 
viral vector is a therapeutic vector. 

The present invention also discloses a packaging cell line which 
complements a viral vector having a deletion or mutation of a DNA sequence 
encoding an adenovirus structural protein, regulatory polypeptides El A and E1B, 
and/or one or more of the following regulatory proteins or polypeptides: E2A, 
E2B, E3, E4, L4, or fragments thereof. 

Various useful packaging cells are contemplated which complement 
adenovirus. In one aspect of the present invention, each DNA sequence is 
introduced into the genome of the within-disclosed cell lines via a separate 
complementing plasmid. In other embodiments, two or more DNA sequences 
were introduced into the genome via a single complementing plasmid. In one 
variation, the complementing plasmid comprises a DNA sequence encoding 
adenovirus fiber protein, polypeptide or fragment thereof. An example of a useful 
complementing plasmid according to the present invention is a plasmid having the 
characteristics of pCLF (for deposit details, see Example 3) 

One embodiment discloses a packaging cell useful in the preparation of 
recombinant adenovirus viral vectors comprising a delivery plasmid comprising an 
adenovirus genome lacking a nucleotide sequence encoding fiber. in one 
variation, the delivery plasmid further comprises a nucleotide sequence encoding 
a foreign polypeptide. A preferred delivery plasmid is pDV44, pEI B gal, or 
pElsplB. In another variation, the cell further comprises a complementing 
plasmid containing a nucleotide sequence encoding fiber, the plasmid being stably 
integrated into the cellular genome of the cell. 

In one embodiment, a composition comprises a cell containing first and 
second delivery plasmids wherein a first delivery plasmid comprises an 
adenovirus genome lacking a nucleotide sequence encoding fiber and incapable 
of directing the packaging of new viral particles in the absence of a second 
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delivery plasmid. and a second delivery plasmid comprises an adenoviral genome 
capable of directing the packaging of new virai particles in the presence of the first 
delivery plasmid. 

In another variation, the first and second delivery plasmids interact within 
the cell to produce a therapeutic viral vector. In yet another variation, the cell 
further comprises a complementing plasmid containing a nucleotide sequence 
encoding fiber, the plasmid being stably integrated into the cellular genome of the 
cell. In still another, the first or second delivery plasmid further comprises a 
nucleotide sequence encoding a foreign polypeptide, in various embodiments, the 
polypeptide is a therapeutic molecule. 

Another embodiment discloses a composition as before, wherein the first 
delivery plasmid lacks adenovirus packaging signal sequences. In another 
aspect, the second delivery plasmid contains a LacZ reporter construct. In 
another variation, the second delivery plasmid further lacks a nucleotide sequence 
encoding an adenovirus regulatory protein. In one variation, the regulatory protein 
is E1. In one embodiment of the above-noted compositions, the complementing 
plasmid has the characteristics of pCLF. 

In another embodiment, a composition is disclosed wherein the first delivery 
plasmid lacks a nucleotide sequence encoding an adenovirus structural protein 
and the second delivery plasmid lacks a nucleotide sequence encoding 
adenovirus E1 protein. In another, the first delivery plasmid lacks a nucleotide 
sequence encoding adenovirus E4 protein and the second delivery plasmid lacks 
a nucleotide sequence encoding adenovirus E1 protein. In still another, the cell 
contains at least one complementing plasmid encoding an adenoviral regulatory 
protein and a structural protein. 

In one preferred variation of the present Invention, a packaging cell line 
expresses fiber protein. In one embodiment, the fiber protein has been modified 
to include a non-native amino acid residue sequence which targets a specific 
receptor, but which does not disrupt trimer formation or transport of fiber into the 
nucleus. In another variation, the non-native amino acid residue sequence alters 
the binding specificity of the fiber for a targeted cell type. In still another 
embodiment, the structural protein is fiber comprising amino acid residue 
sequences from more than one adenovirus serotype. As disclosed herein, the. 
nucleotide sequences encoding fiber protein or polypeptide need not be modified 
solely at one or both termini; fiber protein - and indeed, any of the adenoviais 
structural proteins, as taught herein ~ may be modified "internally" as well as at 
the termini. 
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In one variation, the non-native amino acid residue sequence is coupled to 
the carboxyl terminus of the fiber. In yet another, the non-native amino acid 
residue sequence further includes a linker sequence. Alternatively, the fiber 
protein further comprises a ligand coupled to the linker. A suitable ligand may be 
selected from the group consisting of ligands that specifically bind to a cell surface 
receptor and ligands that can be used to couple other proteins or nucleic acid 
molecules. Typically, any of the packaging cell lines of this invention may have a 
DNA sequence encoding all or part of a fiber protein ~ including modified or 
chimeric proteins — stably integrated into the genome. 

In various aspects of the present invention, a packaging cell line of the 
present invention is derived from a procaryotic cell line; in another, it Is derived 
from a eucaryotic cell line. While various embodiments suggest the use of 
mammalian cells, and more particularly, epithelial cell lines, a variety of other, 
non-epithelial cell lines are used in various embodiments. Thus, while various 
embodiments disclose the use of a cell line selected from the group consisting of 
293, A549. W162, HeLa, Vero, 211, and 211 A cell lines, it is understood that 
various other cell lines are likewise contemplated for use as disclosed herein. 

Therapeutic Viral Vectors and Related Systems 

A. Nucleic Acid Segments 

A therapeutic viral vector or composition of the present invention comprises 
a nucleotide sequence, nucleic acid molecule or segment as described herein. 
Typically, the nucleic acid molecule or molecule encodes a protein or polypeptide 
molecule — or a biologically active fragment thereof — which may be used for 
therapeutic applications. A nucleotide sequence may further comprise an 
enhancer element or a promoter located 3* or 5' to and controlling the expression of 
such a therapeutic nucleotide sequence or gene. 

A subject nucleotide sequence consists of a nucleic acid molecule that 
comprises at least 2 different operatively linked DNA segments. The DNA can be 
manipulated and amplified by PGR as described herein and by using standard 
techniques, such as those described in Molecular Cloning: A Laboratory Manual, 
2nd Ed,, Sambrook et aL, eds., Cold Spring Harbor, New York (1989). Typically, to 
produce a nucleotide sequence of the present invention, the sequence encoding 
the selected polypeptide and the promoter or enhancer are operatively linked to a 
DNA molecule capable of autonomous replication in a cell either in vivo or in vitro. 
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By operatively linking the enhancer element or pronnoter and the nucleotide 
sequence to the vector, the attached segments are replicated along with the vector 
sequences. 

Thus, a recombinant DNA molecule (rDNA) of the present invention is a 
hybrid DNA molecule comprising at least 2 nucleotide sequences not normally 
found together in nature. In various preferred embodiments, one of the sequences 
is a sequence encoding an Ad-derived polypeptide, protein, or fragment thereof. 
Stated another way, a nucleotide sequence of the present invention is one that 
encodes an expressible protein, polypeptide or fragment thereof, and it may further 
include an active constitutive or regulatable {e.g. inducible) promoter sequence. 

A nucleotide sequence of the present invention is optimally from about 20 
base pairs to about 40,000 base pairs in length. Preferably the nucleotide 
sequence is from about 50 bp to about 38,000 bp in length. In various 
embodiments, the nucleotide sequence is of sufficient length to encode one or 
more adenovirus proteins or functional polypeptide portions thereof. Since 
individual Ad polypeptides vary in length from about 19 amino acid residues to 
about 967 amino acid residues, corresponding nucleotide sequences will range 
from about 50 bp up to about 3000 bp, depending on the number and size of 
individual polypeptide-encoding sequences that are "replaced" in the viral vectors 
by therapeutic nucleotide sequences of the present invention. 



In one aspect of the invention, it has been discovered that expression of 
adenovirus late proteins such as the structural proteins in a packaging cell line 
according to the present invention is substantially improved when the expression 
cassette present on the complementing plasmid or in the packaging cell line's 
genome contains an adenovirus tripartite leader (TPL) nucleic acid sequence. 

Thus, the invention contemplates a nucleic acid molecule comprising a TPL 
nucleotide sequence. Preferably, the TPL nucleotide sequence may be operatively 
linked to an intron containing RNA processing signals (such as for example, splice 
donor or splice acceptor sites) suitable for expression in the packaging cell line. 
Most preferably, the intron contains a splice donor site and a splice acceptor site. 
Alternatively, the TPL nucleotide sequence may not comprise an intron. 

In one embodiment, a subject nucleic acid molecule of this invention is 
isolated, /.e., separated from the genetic environment from which the component 
sequences were obtained. Thus, molecular cloning of fragments of a gene will 
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produce an isolated nucleic acid, as will the chemical synthesis of an 
oligonucleotide to build a nucleic acid molecule. 

The intron useful in the present invention is any nucleotide sequence which 
functions in the packaging cell line to provide RNA processing signals, as are well 
known in the art, including splicing signals. Introns have been well characterized 
from a large number of structural genes, and therefor the invention should not be 
considered as limited. Well characterized and preferred introns include a native 
intron 1 from adenovirus, such as AdS's TPL intron 1 ; others include the SV40 VP 
intron; the rabbit beta-globin intron, and synthetic intron constructs. See, for 
example, Petitclerc et aL, J. BiothechnoL, 40:169, 1995, and Choi et aL, MoL Cell. 
BioL, 7 7:3070, 1991. 

The TPL nucleotide sequence comprises either (a) first and second TPL 
exons or (b) first, second and third TPL exons, where each TPL exon in the 
sequence is selected from the group consisting of complete TPL exon 1, partial 
TPL exon 1 , complete TPL exon 2 and complete TPL exon 3. A complete exon is 
one which contains the complete nucleic acid sequence based on the sequence 
found in the wild type viral genome. Preferably the TPL exons are from Ad2, Ad3, 
Ad5, Ad7 and the like, however, they may come from any Ad serotype, as 
described herein. A preferred partial TPL exon 1 is described in the Examples. 
The use of a TPL with a partial exon 1 has been reported (W098/13499). 

The intron and the TPL exons can be operatively linked in a variety of 
configurations to provide a functional TPL nucleotide sequence, although in some 
embodiments of the invention, an intron may not be a part of the construct. For 
example, the intron can be positioned between any of TPL exons 1, 2 or 3, and the 
exons can be in any order of first and second, or first/second/third. The intron can 
also be placed preceding the first TPL exon or following the last TPL exon. In a 
preferred embodiment, complete TPL exon 1 is operatively linked to complete TPL 
exon 2 operatively linked to complete TPL exon 3. In a preferred variation, 
adenovirus TPL Intron 1 is positioned between complete TPL exon 1 and complete 
TPL exon 2. It may also be possible to use analogous translational regulators from 
other viral systems such as rabiesvirus. 

A preferred "complete" TPL nucleic acid molecule containing complete TPL 
exons 1 , 2 and 3 with adenovirus intron 1 inserted between exons 1 and 2 has a 
nucleotide sequence shown in SEQ ID NO: 32. A preferred "partial" TPL nucleic 
acid molecule containing partial TPL exon 1 and complete TPL exons 2 and 3 in 
that order has a nucleotide sequence shown in SEQ ID NO: 26. The construction 
of these preferred TPL nucleotide sequences is described in the Examples. 
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Thus, preferred expression cassettes and complementing plasmids for 
expressing adenovirus structural genes, particularly fiber protein, contain an 
adenovirus TPL nucleotide sequence as described herein. Preferred packaging 
cell lines containing the subject nucleic acid molecules also contain a TPL 
nucleotide sequence of the invention. 



The invention describes in a related embodiment nucleic acid molecules and 
nucleotide sequences, typically in the form of DNA plasmid vectors, which are 
capable of expression of an adenovirus structural protein or regulatory protein. 
Because these expression plasmids are used to complement the defective genes 
of a recombinant adenovirus vector genome of this invention, the plasmids are 
referred to as complementing or complementation plasmids. 

The complementing plasmid contains an expression cassette, a nucleotide 
sequence capable of expressing a protein product off the gene encoded by the 
nucleotide sequence. Expression cassettes are described in more detail herein, 
but typically contain a promoter and a structural gene operatively linked to the 
promoter and whose expression is controlled by the promoter. A preferred 
complementing plasmid further includes a TPL nucleotide sequence described 
herein to enhance expression of the structural gene product when used in the 
context of adenovirus genome replication and packaging. 

In one embodiment, a complementing plasmid comprises a promoter 
nucleotide sequence operatively linked to a nucleotide sequence encoding an 
adenovirus structural polypeptide. The adenovirus structural polypeptide is 
selected from the group consisting of penton base; hexon; fiber; polypeptide Ilia; 
polypeptide V; polypeptide VI; polypeptide VII; polypeptide VIII; and biologically 
active fragments thereof. In another variation, a complementing plasmid further 
comprises a nucleotide sequence encoding a first adenovirus regulatory 
polypeptide, a nucleotide sequence encoding a second regulatory polypeptide, a 
nucleotide sequence encoding a third regulatory polypeptide; or any combination 
of the foregoing. 

The present invention also discloses a complementing plasmid comprising a 
promoter nucleotide sequence operatively linked to a nucleotide sequence 
encoding an adenovirus structural protein, polypeptide or fragment thereof and a 
nucleotide sequence encoding an adenovirus regulatory protein, polypeptide or 
fragment thereof. In one variation, the early region polypeptide is E4; in another, 
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the plasmid comprises pE4/Hygro. In still another variation, the early region 
polypeptides are El and E4. 

In another aspect of the present invention, the complennenting plasmid used 
to transform a cell line of the present invention further comprises a DNA sequence 
encoding an adenovirus regulatory protein, polypeptide or fragment thereof. In one 
variation, the regulatory protein is selected from the group consisting of E1A, E1B, 
E2A, E2B, E3, E4 and L4 (also referred to as "the 100K protein"); an exemplary 
complementing plasmid has the characteristics of is pE4/Hygro (for deposit details, 
see the Examples). In another aspect, the complementing plasmid used to 
transform a cell line of the present invention further comprises a DNA sequence 
encoding two or more of the above mentioned adenovirus regulatory proteins, 
polypeptides or fragments thereof. 

Preferred complementing plasmids include pCLF, pDV60, pDV61, pDV67, 
pDV69. pDV80, pDV90 and the like plasmids described in the Examples. The 
nucleic acid sequence of particularly preferred complementing plasmids are shown 
in SEQ ID NO: 43 for pDV60, SEQ ID NO: 44 for pDV67, SEQ ID NO: 47 for 
pDV69, SEQ ID NO: 64 for pDV80 and SEQ ID NO: 65 for pDV90. 

3. Nucleic Acid Molecule Synthesis 

A nucleic acid molecule comprising synthetic oligonucleotide sequences of 
the present invention can be prepared using any suitable method, such as, the 
phosphotriester or phosphodiester methods. See Narang et aL, Meth, EnzymoL, 
68:90, (1979); U.S. Patent No. 4,356,270; and Brown et aL, Meth. EnzymoL, 
68:109, (1979). 

For oligonucleotide sequences in which a family of variants is preferred, the 
synthesis of the family members can be conducted simultaneously in a single 
reaction vessel, or can be synthesized independently and later admixed in 
preselected molar ratios. For simultaneous synthesis, the nucleotide residues that 
are conserved at preselected positions of the sequence of the family member can 
be introduced in a chemical synthesis protocol simultaneously to the variants by 
the addition of a single preselected nucleotide precursor to the solid phase 
oligonucleotide reaction admixture when that position number of the 
oligonucleotide is being chemically added to the growing oligonucleotide polymer. 
The addition of nucleotide residues to those positions in the sequence that vary 
can be introduced simultaneously by the addition of amounts, preferably equimolar 
amounts, of multiple preselected nucleotide precursors to the solid phase 
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ollgonucleotide reaction admixture during chemical syntliesis. For example, where 
all four possible natural nucleotides (A.T.G and C) are to be added at a 
preselected position, their precursors are added to the oligonucleotide synthesis 
reaction at that step to simultaneously form four variants. 

This manner of simultaneous synthesis of a family of related 
oligonucleotides has been previously described for the preparation of "Degenerate 
Oligonucleotides" by Ausubel et al. (Current Protocols in Molecular Biology, Suppl. 
8. p.2.11.7, John Wiley & Sons, Inc., New York .1991). and can readily be applied 
to the preparation of the therapeutic oligonucleotide compositions described 
herein. 

Nucleotide bases other than the common four nucleotides (A.T.G or C), or 
the RNA equivalent nucleotide uracil (U), can also be used in the present 
invention. For example, it is well known that inosine (I) is capable of hybridizing 
with A, T and G. but not C. Examples of other useful nucleotide analogs are known 
in the art and may be found referred to in 37 C.F.R. §1 .822. 

Thus, where all four common nucleotides are to occupy a single position of 
a family of oligonucleotides, that is, where the preselected nucleotide sequence is 
designed to contain oligonucleotides that can hybridize to four sequences that vary 
at one position, several different oligonucleotide structures are contemplated. The 
composition can contain four members, where a preselected position contains 
A,T,G or C. Alternatively, a composition can contain two nucleotide sequence 
members, where a preselected position contains I or C, and has the capacity the 
hybridize at that position to all four possible common nucleotides. Finally, other 
nucleotides may be included at the preselected position that have the capacity to 
hybridize in a non-destabilizing manner with more than one of the common 
nucleotides rn a manner similar to inosine. 

Similarly, larger nucleic acid molecules can be constructed in synthetic 
oligonucleotide pieces, and assembled by complementary hybridization and 
ligation, as is well known. 

B. Adenovirus Expression Vector Systems 

One key component of the present invention for producing gene therapy 
reagents comprised of recombinant adenovirus particles is the recombinant 
adenovirus vector genome which is encapsulated in the virus particle and which 
expresses exogenous genes in a gene therapy setting. Thus, the components of 
an recombinant adenovirus vector genome include the ability to express selected 
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adenovirus structural genes, to express a desired exogenous protein, and to 
contain sufficient replication and packaging signals that the genome is packaged 
into a gene delivery vector particle. The preferred replication signal is an 
adenovirus inverted terminal repeat containing an adenovirus origin of replication, 
as is well known and described herein. 

According to the present invention, a preferred recombinant adenovirus 
vector genome is "helper independent" which means the genome can replicate and 
be packaged without the help of a second, complementing helper virus. Instead, 
the complementation is provided by a packaging cell line of the present invention. 
Additional embodiments of the invention, however, are drawn to a vector genome 
referred to as "gutless" which is "helper dependent." 

In a preferred embodiment, the adenovirus vector genome does not encode 
a functional adenovirus fiber protein. A non-functional fiber gene refers to a 
deletion, mutation or other modification to the adenovirus fiber gene such that the 
gene does not express any or insufficient adenovirus fiber protein to package a 
fiber-containing adenovirus particle without complementation of the fiber gene by a 
complementing plasmid or packaging cell line. Such a genome is referred to as a 
Tiberless" genome, not to be confused with a fiberless particle. Alternatively, a 
fiber protein may be encoded but is insufficiently expressed to result in a fiber 
containing particle. 

Thus, the invention describes a helper-independent fiberless recombinant 
adenovirus vector genome comprising genes which (a) express all adenovirus 
structural gene products but express insufficient adenovirus fiber protein to 
package a fiber-containing adenovirus particle without complementation of said 
fiber gene, (b) express an exogenous protein, and (c) contains an adenovirus 
packaging signal and inverted terminal repeats containing adenovirus origin of 
replication. 

The introduction of exogenous DNA into eucaryotic cells has become one of 
the most powerful tools of the molecular biologist. The term "exogenous" 
encompasses any therapeutic composition of this invention which is administered 
by the therapeutic methods of this invention. Thus, "exogenous" may also be 
referred to herein as "foreign," "non-native," and the like. The methods of this 
invention preferably require efficient delivery of the DNA into the nucleus of the 
recipient cell and subsequent identification of cells that are expressing the foreign 
DNA. 

The adenovirus vector genome is propagated in the laboratory in the form of 
rDNA plasmids containing the genome, and upon introduction into an appropriate 
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host, the viral genetic elements provide for viral genome replication and packaging 
rather than plasmid-based propagation. Exemplary methods for preparing an Ad- 
vector genome are described in the Examples. 

A widely-used plasmid is pBR322. a vector whose nucleotide sequence and 
endonuclease cleavage sites are well known. Various other useful plasmid 
vectors are described in the Examples that follow. 

A nucleic acid vector of the present invention comprises a nucleic acid 
(preferably DNA) molecule capable of autonomous replication in a cell and to 
which a DNA segment, e,g., a gene or polynucleotide, can be operatively linked so 
as to bring about replication of the attached segment. In the present invention, one 
of the nucleotide segments to be operatively linked to vector sequences encodes at 
least a portion of a therapeutic nucleic acid molecule ~ in effect, a nucleic acid 
sequence that encodes one or more therapeutic proteins or polypeptides, or 
fragments thereof. 

As one of skill in the art will note, in various embodiments of the present 
invention, different "types" of vectors are disclosed. For example, one "type" of 
vector is used to deliver particular nucleotide sequences into a packaging cell line, 
with the intent of having said sequences stably integrate into the cellular genome; 
these "types" of vectors are generally identified herein as complementing plasmids. 
A further "type" of vector described herein carries or delivers nucleotide sequences 
in or into a cell line {e,g., a packaging cell line) for the purpose of propagating 
therapeutic viral vectors of the present invention; hence, these vectors are 
generally referred to herein as delivery plasmids. A third "type" of vector described 
herein is utilized to carry nucleotide sequences encoding therapeutic proteins or 
polypeptides to specific cells or cell types in a subject in need of treatment; these 
vectors are generally identified herein as therapeutic viral vectors or Ad-derived 
vectors and are in the form of a virus particle encapsulating a viral nucleic acid 
containing an expression cassette nucleic acid sequence for expressing the 
therapeutic gene. 



In various embodiments, a peptide-coding sequence of the therapeutic gene 
is inserted into an expression vector and expressed; however, it is also feasible to 
construct an expression vector which also includes some non-coding sequences as 
well. Preferably, however, non-coding sequences are excluded. Alternatively, a 
nucleotide sequence for a soluble form of a polypeptide may be utilized. Another 
preferred therapeutic viral vector includes a nucleotide sequence encoding at least 
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a portion of a therapeutic nucleotide sequence operatively linked to the expression 
vector for expression of the coding sequence in the therapeutic nucleotide 
sequence. 

As used herein with regard to DNA sequences or segments, the phrase 
"operatively linked" generally means the sequences or segments have been 
covalently joined into one piece of DNA, whether in single or double stranded form. 

The choice of viral vector into which a therapeutic nucleotide sequence of 
this invention is operatively linked depends directly, as is well known in the art. on 
the functional properties desired, e.g., vector replication and protein expression, 
and the host cell to be transformed - these being limitations inherent in the art of 
constructing recombinant DNA molecules. Although certain adenovirus serotypes 
are recited herein in the form of specific examples, it should be understood that the 
present invention contemplates the use of any adenovirus serotype,, including 
hybrids and derivatives thereof. As one will observe, it is not unusual or outside 
the scope of the present invention to utilize nucleotide and/or amino acid residue 
sequences of two or more serotypes in constructs, compositions and methods of 
the invention. 

A translatable nucleotide sequence is a linear series of nucleotides that 
provide an uninterrupted series of at least 8 codons that encode a polypeptide in 
one reading frame. Preferably, the nucleotide sequence is a DNA sequence. The 
vector itself may be of any suitable type, such as a viral vector (RNA or DNA), 
naked straight-chain or circular DNA, or a vesicle or envelope containing the 
nucleic acid material and any polypeptides that are to be inserted into the cell. 

2. Promoters 

As noted elsewhere herein, an expression nucleic acid in an Ad-derived 
vector of the present invention may also include a promoter sequence. 

In general, promoters are DNA segments that contain a DNA sequence that 
controls the expression of a gene located 3' or downstream of the promoter. The 
promoter is the DNA sequence to which RNA polymerase specifically binds and 
initiates RNA synthesis (transcription) of that gene, typically located 3' of the 
promoter A promoter also includes DNA sequences which direct the initiation of 
transcription, including those to which RNA polymerase specifically binds. If more 
than one nucleic acid sequence encoding a particular polypeptide or protein is 
included in a therapeutic viral vector or nucleotide sequence, more than one 
promoter or enhancer element may be included, particularly if that would enhance 
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efficiency of expression. For purposes of the present invention, regulatable 
(inducible) as well as constitutive promoters nnay be used, either on separate 
vectors or on the same vector. 

Both constitutive and regulatable (often called "inducible") promoters are 
useful in constructs and methods of the present invention. For example, some 
useful regulatable promoters are those of the CREB-regulated gene family and 
include inhibin, gonadotropin, cytochrome c, glucagon, and the like. (See, e.g., 
published International App. No. WO96/14061. 

A regulatable or inducible promoter may be described as a promoter 
wherein the rate of RNA polymerase binding and initiation is modulated by external 
stimuli. (See U.S. Patent Nos. 5,750,396 and 5,998,205). Such stimuli include 
various compounds or compositions, light, heat, stress, chemical energy sources, 
and the like. Inducible, suppressive and repressible promoters are considered 
regulatable promoters. 

Regulatable promoters may also include tissue-specific promoters. Tissue- 
specific promoters direct the expression of the gene to which they are operably 
linked to a specific cell type. Tissue-specific promoters cause the gene located 3* 
of it to be expressed predominantly, if not exclusively, in the specific cells where 
the promoter expressed its endogenous gene. Typically, it appears that if a 
tissue-specific promoter expresses the gene located 3' of it at all, then it is 
expressed appropriately in the correct cell types (see, e.g., Palmiter et aL, Ann, 
Rev. Genet 2a 465-499. 1986). 

When a tissue-specific promoter controls the expression of a gene, that 
gene will be expressed in a small number of tissues or cell types rather than in 
substantially all tissues and cell types. Examples of tissue-specific promoters 
include the immunoglobulin promoter described by Brinster et al., Nature 306: 
332-336 (1983) and Storb et aL, Nature 31 a 238-231 (1984); the elastase-l 
promoter described by Swift et al., Cell 38: 639-646 (1984); the globin promoter 
described by Townes et al., MoL Cell. BioL 5: 1977-1983 (1985), and Magram et 
al., MoL Cell, BioL 9: 4581-4584 (1989), the insulin promoter described by Bucchini 
etaL, PNAS USA, 83: 2511-2515 (1986) and Edwards et aL, Cell 58: 161 (1989); 
the immunoglobulin promoter described by Ruscon et aL, Nature 314: 330-334 
(1985) and Grosscheld et aL, Cell 38: 647-658 (1984); the alpha actin promoter 
described by Shani, MoL CelL BioL 6: 2624-2631 (1986); the alpha crystalline 
promoter described by Overbeek et al., PNAS USA 82: 7815-7819 (1985); the 
prolactin promoter described by Crenshaw et aL, Genes and Development 3: 
959-972 (1989); the propiomelanocortin promoter described by Tremblay et aL, 
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PNAS USA 85: 8890-8894 (1988); the beta-thyroid stimulating hormone (BTSH) 
promoter described by Tatsumi et aL, Nippon Rinsho 47: 2213-2220 (1989); the 
mouse mammary tumor virus (MMTV) promoter described by Muller et ah, Cell 54: 
105 (1988); the albumin promoter described by Palmiter et a/., Ann, Rev. Genet 
20: 465-499 (1986); the keratin promoter described by Vassar et aL, PNAS USA 
86\ 8565-8569 (1989); the osteonectin promoter described by McVey ef a/., J. BioL 
Chem. 263: 11,111-11,116 (1988); the prostate-specific promoter described by 
Allison etaL, MoL Cell. BioL 9: 2254-2257 (1989); the opsin promoter described by 
Nathans et ai, PNAS USA 81: 4851-4855 (1984); the olfactory marker protein 
promoter described by Danciger et aL, PNAS USA 86: 8565-8569 (1989); the 
neuron-specific enolase (NSE) promoter described by Forss-Pelter et aL, J. 
NeuroscL Res, 16: 141-151 (1986); the L-7 promoter described by Sutcliffe, Trends 
in Genetics 3: 73-76 (1987) and the protamine 1 promoter described Peschon et 
aL, Ann, New York Acad, ScL 564: 186-197 (1989) and Braun et aL, Genes and 
Developments-, 793-802 (1989). 



Although adenovirus consists of many proteins, not all adenovirus proteins 
are required for assembly of a recombinant adenovirus particle (vector) of this 
invention. Thus, deletion of the appropriate genes from a recombinant Ad vector as 
taught herein will thus allow the vector to accommodate even larger "foreign" DNA 
segments. Thus, if the sequences encoding one or more adenovirus polypeptides 
or proteins are supplanted by a recombinant nucleotide sequence of the present 
invention, the length of the recombinant sequence can conceivably extend nearly 
to the packaging limit of the relevant adenovirus-derived vector. 

In view of the fact that preferred embodiments disclosed herein are helper- 
independent Ad-derived vectors, the entire wild-type Ad genome cannot be 
completely supplanted by recombinant nucleic acid molecules without transforming 
such a vector into a vector requiring "help" of some kind. However, most of the Ad- 
derived vectors of the present invention do not depend on a helper virus; instead, 
the vectors of the present invention are propagated in cell lines stably expressing 
proteins or polypeptides that have been removed from said vectors to allow the 
addition of **foreign" DNA into the vectors. In various disclosed embodiments, 
specific early region and structural polypeptides are deleted from the vectors of the 
present invention, thereby enabling the vectors to accommodate recombinant 
nucleic acid sequences (or cassettes) of various lengths. For example, Ad-derived 
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vectors of the present invention may easily include 12 kb or more of foreign (or 
therapeutic") DNA sequences. 

Thus, adenovirus viral vectors are also disclosed which comprise nucleotide 
sequences encoding a packaging signal and a foreign protein or polypeptide, 
wherein the nucleotide sequence encoding an adenovirus structural protein has 
been deleted. 

in one variation, the nucleotide sequence encoding the foreign protein or 
polypeptide is a DNA molecule up to about 3 kb in length; in another, the 
nucleotide sequence encoding the foreign protein or polypeptide is a DNA 
molecule up to about 9.5 kb in length; in still another, the nucleotide sequence 
encoding the foreign protein or polypeptide is a DNA molecule up to about 12.5 kb 
in length. Nucleotide sequences of intermediate lengths are also contemplated by 
the present invention, as are sequences in excess of 1 2.5 kb. 

The invention also discloses viral vectors wherein the sequence encoding a 
foreign protein or polypeptide is a sequence encoding an anti-tumor agent, a tumor 
suppressor protein, a suicide protein, or a fragment or functional equivalent 
thereof. In one variation, nucleotide sequences encoding one or more regulatory 
proteins have also been deleted from the vector. In another, the regulatory 
proteins are selected from the group consisting of El A, E1B. E2A, E2B, E3, E4, 

and L4 (100K protein). 

A wide variety of therapeutic viral vectors are also embodiments of the 
present invention. In one embodiment, a therapeutic viral vector is disclosed which 
lacks a DNA sequence encoding fiber protein, or a portion thereof. In another 
variation, a therapeutic viral vector may further or altematively comprise deletion of 
a DNA sequence encoding one or more regulatory proteins, polypeptides, or 
fragments thereof. In various embodiments, foreign DNA sequences are inserted 
in place of the DNA sequence encoding fiber protein in the viral vectors of the 
present invention. In other embodiments, the therapeutic viral vectors further 
comprise foreign DNA sequences Inserted In place of the DNA sequences 
encoding one or more regulatory proteins, polypeptides, or fragments thereof, 
and/or one or more structural proteins, polypeptides, or fragments thereof. 

The present invention further discloses a number of viral vectors. In one 
variation, a viral vector comprises a deletion or mutation of a DNA sequence 
encoding an adenovirus structural protein, polypeptide, or fragment thereof. A 
vector may further comprise deletion or mutation of the DNA sequences encoding 
regulatory polypeptides El A and E1B; and it may still further comprise deletion or 
mutation of the DNA sequence encoding one or more of the following regulatory 
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proteins or polypeptides: E2A, E2B, E3, E4, L4, or fragments thereof. In another 
variation, in a viral vector of the present invention, the structural protein comprises 
fiber. Any combination of the foregoing is also contemplated by the present 
invention. The viral vectors of the present invention are suitable for the 
preparation of pharmaceutical compositions comprising any of the therapeutic viral 
vectors disclosed herein - including combinations thereof - are also disclosed 
herein. A further use of the viral vectors of the present invention is for targeting 
specific cells in a cell population comprising different cell types. 

Yet another variation discloses that a foreign DNA sequence encoding one 
or more foreign proteins, polypeptides or fragments thereof has been inserted in 
place of any of the deletions in the therapeutic viral vector. In one embodiment, 
the foreign DNA encodes a tumor-suppressor protein or a biologically active 
fragment thereof. In another embodiment, the foreign DNA encodes a suicide 
protein or a biologically active fragment thereof. 

The invention further contemplates that a viral vector comprises a foreign 
DNA sequence encoding one or more foreign proteins, polypeptides or fragments 
thereof wherein said DNA sequence has been inserted in place of any structural 
and/or regulatory proteins (or portions thereof) that have been deleted. Thus, in 
one embodiment, the foreign DNA encodes a therapeutic molecule such as a 
tumor-suppressor protein; a suicide protein; a cystic fibrosis transmembrane 
conductance regulator (CFTR) protein; or a biologically active fragment of any of 
them. 

The therapeutic (or foreign) nucleotide sequence can be a gene or gene 
fragment that encodes a protein or polypeptide -- or a biologically active fragment 
thereof. (See, e.g.. Crystal, et ai, Nature Genetics 8: (1994); Zabner, ef a/.. 

Cell 75: 207-216 (1993); Knowles, et ai. NEJM 333(13): 823-831 (1995); and 
Rosenfeld, etal.. Cell 68: 143-155 (1992). 

It is further contemplated that a therapeutic protein or polypeptide 
expressed by a therapeutic viral vector of the present invention may be used in 
conjunction with another therapeutic agent when appropriate - e.g., a thymidine 
kinase metabolite may be used In conjunction with the gene encoding thymidine 
kinase and its gene product -- in order to be even more effective. 

Alternatively, a therapeutic viral vector can include a DNA or RNA 
oligonucleotide sequence that exhibits therapeutic activity without needing to be 
translated into a polypeptide product before exerting a therapeutic effect. 
Examples of the latter include antisense oligonucleotides that will inhibit the 
transcription of deleterious genes or ribozymes that act as site-specific 
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ribonucleases for cleaving selected mutated gene sequences. In another variation, 
a therapeutic nucleotide sequence of the present invention may comprise a DNA 
construct capable of generating therapeutic nucleotide molecules, including 
ribozymes and antisense DNA, in high copy numbers in target cells, as described 
in published PCT application No. WO 92/06693 (the disclosure of which is 
incorporated herein by reference). Other preferred therapeutic nucleotide 
sequences according to the present invention are capable of delivering HIV 
antisense oligonucleotides to latently-infected T cells via CD4. Similarly, delivery 
of Epstein-Barr Virus (EBV) EBNa-1 antisense oligonucleotides to B cells via CR2 
is capable of effecting therapeutic results. 

A preferred recombinant adenovirus vector genome is based on the vector 
described in the Examples and designated Ad5.pgal.AF. This vector is a helper 
independent, fiberless vector genome which can host, upon insertion, an 
exogenous gene for expression of an exogenous or therapeutic protein. The 
genome of Ad5.pgal.AF has a nucleotide sequence shown in SEQ ID NO: 27. A 
virus particle containing Ad5.pgat.AF vector genome has been prepared as 
described in the Examples and is deposited with the ATCC as Accession No. VR- 
2636 

The Ad5.pgal.AF genome nucleic acid can be manipulated to contain any 
exogenous gene in place of the beta-galactosidase gene present in the construct, 
as described herein. 

Construction of Therapeutic Viral Vectors for Gene Delivery 
A. Adenovirus Particles 

Various methods of making and using the vectors, plasmids, cell lines and 
other compositions and constructs of the present invention are also disclosed 
herein. The following methods are considered exemplary and not limiting. 

Thus, in one variation, the invention discloses a method of constructing 
therapeutic viral vectors, comprising introducing a delivery plasmid into an Ad fiber- 
expressing complementing cell line, wherein the DNA sequence encoding Ad fiber 
protein has been deleted from the delivery plasmid. In one variation, the delivery 
plasmid further includes a DNA sequence encoding a foreign protein, polypeptide, 
or fragment thereof. In other embodiments, a combination of pDV44 and 
pAEIBpgal or a similar construct such as, for example, that found in pDV44, 
pAEIBpgal or the equivalent. 
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A recombinant adenovirus particle may be produced with a fiber protein, or it 
may be produced without a fiber protein ("fiberless particle") according to the 
present invention. Where the particle is made without fiber, such as by passaging 
the fiberless viral vector genome, e.g., Ad5.pgal.AF in the 293 cells, a fiberless 
genome is replicated and packaged in a fiberless particle. In contrast, where the 
fiberless genome Ad5.pgal.AF is passaged in the 211B or other fiber expressing 
cells, a fiberless genome is replicated and packaged into a fiber-containing 
particle. 

Recombinant adenovirus particles may be made such that they include no 
fiber proteins, modified fiber proteins or other exogenous proteins. They may also 
be produced in systems using either helper-independent or helper-dependent 
adenovirus recombinant genomes, i.e. with or without helper viruses. 

B. Targeting of Particles to Tissues - Virus Trophism 

A preferred viral vector particle in which therapeutic nucleotide compositions 
of this invention are present is derived from adenovirus (Ad). As taught herein, 
viral vector particles of this invention may be designed and constructed in such a 
way that they specifically target a preselected recipient cell type, depending on the 
nature of therapy one seeks to administer. Methods of making and using 
therapeutic viral vectors that target specific cells are further described in the 
Examples that follow. 

Novel vectors, viral particles or compositions may also be designed and 
prepared to preferentially target cells that might not othenwise be targeted by wild- 
type adenovirus virions. For example, in order to target non-epithelial cells, one 
following the teachings of the present . specification may be able to prepare a 
therapeutic vector particle including a nucleotide sequence encoding a foreign 
protein, polypeptide or other ligand directed to a non-epithelial cell or to a different 
receptor than that generally targeted by a particular adenovirus. Examples of 
useful ligands directed to specific receptors (identified in parentheses) include the 
V3 loop of HIV gp120 (CD4); transferring (transferrin receptor); LDL, apollpoprotein 
B100. apollpoprotein E (LDL receptors): and deglycosylated proteins 
(asialoglycoprotein receptor). Various useful ligands which may be added to 
adenovirus fiber ~ and methods for preparing and attaching same - are set forth in 
U.S. Patent Nos. 5,756,086 and 5,543,328. 

In yet another embodiment, the non-native amino acid residue sequence is 
incorporated into the fiber amino acid residue sequence at a location other than 
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one of the fiber termini. Alternatively, the non-native amino acid residue sequence 
alters the binding specificity of the fiber for a targeted cell type. In other 
embodiments, the linker sequence alters the binding specificity of the fiber for a 
targeted cell type. The expressed fiber may, in various embodiments, bind to a 
specific targeted cell type not usually targeted by adenovirus and/or may comprise 
amino acid residue sequences from more than one adenovirus serotype. 

Useful ligands may be encoded by a foreign nucleotide sequence contained 
within a viral vector of the present invention, or may be linked to proteins or 
polypeptides, include antibodies and attachment sequences, as well as receptors 
themselves. For example, antibodies to cell receptor molecules such as integrins 
and the like, MHC Class I and Class II, asialoglycoprotein receptor, transferrin 
receptors, LDL receptors, CD4, and CR2 are but a few which are useful according 
to the present invention. It is also understood that the ligands typically bound by 
receptors, as well as analogs to those ligands, may be used as cellular targeting 
agents, as disclosed herein. 

Therapeutic Methods 

The recombinant adenovirus vectors of the present invention, typically in the 
form of an adenovirus particle encapsulating a recombinant adenovirus vector 
genome containing an expression cassette for expressing a therapeutic gene, are 
particularly suited for gene therapy. Thus, various therapeutic methods are 
contemplated by the present invention. 

For example, it has now been discovered that Ad-derived viral vectors are 
capable of delivering a therapeutic nucleotide sequence to a specific cell or tissue, 
based on the tissue tropism of the particle, thereby expanding and enhancing 
treatment options available in numerous conditions in which more conventional 
therapies are of limited efficacy. Accordingly, methods of gene therapy utilizing a 
recombinant adenovirus particle containing a modified fiber or chimeric fiber which 
targets a preselected tissue, as described herein, is within the scope of the 
invention. Vector particles are typically purified and then an effective amount is 
administered in vivo or ex vivo {in wYro)into the subject. 

For in vitro or ex vivo gene transfer, administration is often accomplished by 
first isolating a selected cell population from a patient such as lung epithelial cells, 
lymphocytes and the like followed by in vitro or ex vivo gene transfer of the 
therapeutic compositions of this invention and the replacement of the cells into the 
patient. In vivo therapy is also contemplated, e.g., via the administration of 
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therapeutic compositions of this invention by various delivery means. For example, 
aerosol administration and administration via subcutaneous, intravenous, 
intraperitoneal, intramuscular, ocular means and the like are also within the scope 
of the present invention. 

Other gene-delivery methods are also useful in conjunction with the 
methods, compositions and constructs of the present invention; see, e,g., 
published International Application No. WO 95/11984. 

The present invention also contemplates various methods of targeting 
specific cells - e.g., cells in a subject in need of diagnosis and/or treatment. As 
discussed herein, the present invention contemplates that the viral vectors and 
compositions of the present invention may be directed to specific receptors or cells, 
for the ultimate purpose of delivering those vectors and compositions to specific 
cells or cell types. The viral vectors and constructs of the present invention are 
particularly useful in this regard. 

In general, adenovirus attachment and uptake into ceils are separate but 
cooperative events that result from the interaction of distinct viral coat proteins with 
a receptor for attachment and av integrin receptors for internalization. Adenovirus 
attachment to the cell surface via the fiber coat proteins has been discovered to be 
dissociable and distinct from the subsequent step of internalization, and the 
present invention is able to take advantage of and function in conjunction with 
these differing receptors. 

The invention also discloses methods of transforming a pathologic 
hyperproliferative mammalian cell comprising contacting the cell with any of the 
vectors described herein. In another embodiment, methods of infecting a 
mammalian target cell with a viral vector containing a preselected foreign 
nucleotide sequence are disclosed. One such variation comprises the following 
steps: (a) infecting the target cell with a viral vector of the present invention, the 
viral vector carrying a preselected foreign nucleotide sequence; and (b) expressing 
the foreign nucleotide sequence in the targeted celL 

jhe invention also encompasses mammalian target cells infected with a 
preselected foreign nucleotide sequence produced by the methods disclosed 
herein. In one variation, the target cells are selected from the group consisting of 
replicating, slow-replicating and non-replicating human cells. 

Methods of treating an acquired or hereditary disease are also disclosed. 
One method comprises (a) administering a pharmaceutically acceptable dose of a 
viral vector to a target cell, wherein the vector comprises a preselected therapeutic 
nucleotide sequence; and (b) expressing the therapeutic sequence in the target 
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cell for a time period sufficjent to ameliorate the acquired or hereditary disease in 
the cell. Method of gene therapy comprising administering to a subject an effective 
amount of a therapeutic viral vector produced by a packaging ceil line of the 
present invention are also disclosed. 

Also contemplated by the present invention are various methods of inhibiting 
the proliferation of a tumor in a subject comprising administering an effective 
amount of a therapeutic viral vector of the present invention under suitable 
conditions to the subject. In one variation, the gene encodes an anti-tumor agent. 
In another variation, the agent is a tumor-suppressor gene. In still another 
embodiment, the agent is a suicide gene or a functional equivalent thereof. In 
another variation, the vector is administered via intra-tumoral injection. 

A composition of this invention may be used prophylactically or 
therapeutically in vivo to disrupt HIV infection and mechanisms of action by 
inhibiting gene expression or activation, via delivery of antisense HIV sequences or 
ribozymes to T cells or monocytes. Using methods of the present invention, one 
may target therapeutic viral vectors as disclosed herein to specific cells and 
tissues, including hematopoietic cells, as Infection of such cells appears to be 
mediated by distinct integrins to which viral vectors of the present invention may 
readily be targeted. (See, e.g., Huang, ef a/., J. ViroL 70: 4502-8, 1996). 

Other useful therapeutic nucleotide sequences include antisense nucleotide 
sequences complementary to EBV EBNa-1 gene. Use of such therapeutic 
sequences may remediate or prevent latent infection of B cells with EBV. As 
discussed herein and in the Examples below, targeting and delivery may be 
accomplished via the use of various ligands, receptors, and other appropriate 
targeting agents. 

Thus, in one embodiment, a therapeutic method of the present invention 
comprises contacting the cells of a subject infected with EBV or HIV with a 
therapeutically effective amount of a pharmaceutically acceptable composition 
comprising a therapeutic nucleotide sequence of this invention. In a related 
embodiment, the contacting involves introducing the therapeutic nucleotide 
sequence composition into cells having an EBV or HIV-mediated infection. 

Methods of gene therapy are well known in the art (see, e,g,, Larrick and 
Burck, Gene Therapy: Application of Molecular Biology, Elsevier Science Publ. 
Co., Inc., New York, NY (1991); Kriegler, Gene Transfer and Expression: A 
Laboratory Manual, W. H. Freeman and Company, New York, 1990). The term 
"subjecr should be understood to include any animal - particularly mammalian — 
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patient, such as any murine, rat, bovine, porcine, canine, feline, equine, ursine, or 
human patient. 

When the foreign gene carried in the vector encodes a tumor suppressor 
gene or another anti-tumor protein, the vector is useful to treat or reduce 
hyperproliferative cells in a subject, to inhibit tumor proliferation in a subject or to 
ameliorate a particular, related pathology. 

The present invention also contemplates methods of depleting suitable 
samples of pathologic mammalian hyperproliferative cells contaminating 
hematopoietic precursors during bone marrow reconstitution via the introduction of 
a wild-type tumor suppressor gene into the cell preparation using a vector of this 
invention. As used herein, a suitable sample is defined as a heterogeneous cell 
preparation obtained from a patient, e.g., a mixed population of cells containing 
both phenotypically normal and pathogenic cells. 

Administration includes - but is not limited to ~ the introduction of 
therapeutic agents of the present invention into a cell or subject via various means, 
including direct injection, intravenously, intraperitoneally, via intra-tumor injection, 
via aerosols, or topically. Therapeutic agents as disclosed herein may also be 
combined for administration of an effective amount of the agents with a 
pharmaceutically-acceptable carrier, as described herein. 

As used herein, "effective amount" generally means the amount of vector 
particle (or proteins produced/released thereby) which achieves a positive outcome 
in the subject to whom the vector is administered. The total volume administered 
will necessarily vary depending on the mode of administration, as those of skill in 
the relevant art will appreciate, and dosages may vary as well. 

The dose of a biologic vector (particle) is somewhat complex and may be 
described in terms of the concentration (in plaque-forming units per milliliter 
(pfu/ml)), the total dose (In pfus), or the estimated number of particles administered 
per cell (the estimated multiplicity of infection or MOI). Thus, if a vector is 
administered via infusion - say, across nasal epithelium - at a constant total 
volume, the respective concentration, etc. may be described as follows: 
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In general, when recombinant adenoviral vector particles are administered 
via infusion across the nasal epithelium (e.g. an area of nasal epithelium 
containing 2 x 10^ cells.) administered amounts producing an estimated MOI 
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dosages. (See, e.g., Knowles, etaL, New Eng. J. Med. 333:823-831, 1995). 
Similarly, when direct injection is the preferred treatment modality — e.g,, direct 
injection of a viral vector into a tumor doses of 1 x 1 0^ pfu or greater are 
generally preferred. (See, e.g., published International App. No. W095/11984.) 

Thus, depending on the mode of administration, an effective amount 
administered in a single dose preferably contains from about 10^ to about ^0^^ 
infectious units. A typical course of treatment would be one such dose per day 
over a period of five days. As those of skill in the art will appreciate, an effective 
amount may vary depending on (1) the pathology or other condition to be treated, 
(2) the status and sensitivity of the patient, and (3) various other factors well known 
to those of skill in the art, such as the patient's tolerance to other courses of 
treatment that may have been applied previously. Thus, those of skill in the art 
may easily and precisely determine effective amounts of the agents/vectors of the 
present invention which may be administered to a particular patient, based on their 
understanding of and evaluation of such factors. 

The present invention also contemplates methods of ameliorating 
pathologies characterized by hyperproliferative cells or genetic defects in a subject, 
by administering to the subject an effective amount of a vector as described herein. 
Such vectors preferably contain a foreign gene encoding a gene product (e.g. 
polypeptide or protein) having the ability to ameliorate the pathology, under 
suitable conditions. As used herein, the term "genetic defect" means any disease. 
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condition or abnormality wliicli results from inherited factors, e.g. Huntington's 
Disease, Tay-Sachs Disease, or Sickle Cell Disease. 

The present invention further provides methods for reducing the proliferation 
of tumor cells in a subject by introducing into the tumor mass an effective amount of 
an adenoviral expression vector containing an anti-tumor gene other than a tumor 
suppressor gene. The anti-tumor gene can encode, for example, thymidine kinase 
(TK). An effective amount of a therapeutic agent is then administered to the 
subject; the therapeutic agent, in the presence of the anti-tumor gene, is toxic to 
the cell. 

Using thymidine kinase as exemplary, the therapeutic agent is a thymidine 
kinase metabolite such as ganciclovir (GCV), 6-methoxypurine arablnonucleoside 
(araM), or a functional equivalent thereof. Both the thymidine kinase gene and the 
thymidine kinase substrate must be used concurrently in order to exert a toxic 
effect on the host cell. In the presence of the TK gene. GCV is phosphorylated and 
becomes a potent inhibitor of DNA synthesis, whereas araM is converted to the 
cytotoxic anabolite araATP. Thus, the precise method of action or synergism is not 
relevant to therapeutic efficacy; what Is relevant is the fact that the concurrent use 
of appropriate genes and therapeutic agents may effectively ameliorate a specific 
disease condition. 

Another useful example contemplates use of a vector of the present 
invention which expresses the enzyme cytosine deaminase. Such a vector could 
be used In conjunction with administration of the drug 5-fluorouracil (Austin and 
Huber, MoL Pharm. 43: 380-387, 1993) or the recently-described E. coli Deo gene 
in combination with 6-methyl-purine-2'-deoxyribonucleoside (Sorscher etal., Gene 
Therapy 233-238, 1994). 

As with the use of the tumor suppressor genes described previously, the use 
of other anti-tumor genes, either alone or in combination with the appropriate 
therapeutic agent, provides a treatment for the uncontrolled cell growth or 
proliferation characteristic of tumors and malignancies. Thus, the present invention 
provides therapies to halt the uncontrolled cellular growth in a patient, thereby 
alleviating the symptoms or the disease or cachexia present In the patient. The 
effect of this treatment includes, but Is not limited to, prolonged survival time of the 
patient, reduction in tumor mass or burden, apoptosis of tumor cells, or the 
reduction in the number of circulating tumor cells. IVleans of quantifying the 
beneficial effects of this therapy are well known to those of skill In the art. 

The present Invention provides a recombinant adenovirus expression vector 
characterized by the partial or total deletion of one or more adenoviral structural 
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protein genes, such as the gene encoding fiber, which allows the vector to 
accommodate a therapeutic, foreign nucleic acid sequence encoding a functional 
foreign polypeptide, protein, or biologically active fragment thereof. A therapeutic 
gene sequence may be introduced into a tumor mass by combining the adenoviral 
expression vector with a suitable pharmaceutically acceptable carrier. Introduction 
can be accomplished, for example, via direct injection of the recombinant Ad vector 
into the tumor mass. 

A method of tumor-specific delivery of a tumor-suppressor gene is 
accomplished by contacting target tissue in a subject with an effective amount of a 
recombinant Ad-derived vector of this invention. In the case of anti-tumor therapy, 
the gene is intended to encode an anti-tumor agent, such as a functional tumor 
suppressor gene product or suicide gene product. The temri "contacting" is 
intended to encompass any delivery method for the efficient transfer of the vector, 
such as via intra-tumoral injection. 

In another example, adenovirus vectors of the present invention can be used 
to transfer genes to central nervous system (CNS) tumors in vivo. 

The present invention also contemplates methods for determining the 
efficacy of the within-disclosed therapeutic compositions and methods. One such 
method for confirming efficacy utilizes the human/SCID (severe combined 
immunodeficient) mouse model of EBV-induced LPD (lymphoproliferative disease) 
to ascertain whether EBV-antisense therapeutic nucleotide sequences block tumor 
formation. (See, e.g., Pisa, etaL, Blood 79: 173-179 (1992); Rowe, etaL, Curr. 
Top, Microbiol. Immunol. 166: 325 (1990); and Cannon, etaL, J^CIin. Invest 85: 
1333-1337(1990), 

Finally, the use of Ad vectors of the present invention to prepare 
medicaments for the treatment, therapy and/or diagnosis of various diseases is 
also contemplated by this invention. Moreover, other anti-tumor genes may be 
used in combination with the corresponding therapeutic agent to reduce the 
proliferation of tumor cells. Such other gene-and-therapeutic-agent combinations 
are known to those of skill in the art and may be applied as taught herein. 

A. Therapeutic Composftions 

In various alternative embodiments of the present invention, therapeutic 
sequences and compositions useful for practicing the therapeutic methods 
described herein are contemplated. Therapeutic compositions of the present 
invention may contain a physiologically tolerable carrier together with one or more 
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therapeutic nucleotide sequences of this invention, dissolved or dispersed therein 
as an active ingredient. In a preferred embodiment, the composition is not 
immunogenic or otherwise able to cause undesirable side effects when 
administered to a subject for therapeutic purposes. 

As used herein, the terms "pharmaceutically acceptable," "physiologically 
tolerable" and grammatical variations thereof, as they refer to compositions, 
carriers, diluents and reagents, are used interchangeably and represent that the 
materials are capable of administration to or upon a subject - e.g., a mammal - 
without the production of undesirable physiological effects such as nausea, 
dizziness, gastric upset and the like. 

For example, the present invention comprises therapeutic compositions 
useful in the specific targeting of epithelial or non-epithelial cells as well as in 
delivering a therapeutic nucleotide sequence to those cells. Therapeutic 
compositions designed to preferentially target to epithelial cells may comprise a 
recombinant adenovirus-derived vector particle including a therapeutic nucleotide 
sequence. As described herein, a number of adenovirus-derived moieties are 
described, including particles lacking fiber, particles that contain wild type 
adenovirus fiber, and particles that contain modified or chimeric fiber, each type 
providing a different tissue tropism to the particle. 

The preparation of a pharmacological composition that contains active 
ingredients dissolved or dispersed therein is well understood in the art. Typically 
such compositions are prepared as injectables either as liquid solutions or 
suspensions — however, solid forms suitable for solution or suspension in liquid 
prior to use can also be prepared. A preparation can also be emulsified, or 
formulated into suppositories, ointments, creams, dermal patches, or the like, 
depending on the desired route of administration. 

Physiologically tolerable carriers are well known in the art. Exemplary of 
liquid carriers are sterile aqueous solutions that contain no materials in addition to 
the active ingredients and water, or contain a buffer such as sodium phosphate at 
physiological pH value, physiological saline or both, such as phosphate-buffered 
saline. Still further, aqueous carriers can contain more than one buffer salt, as well 
as salts such as sodium and potassium chlorides, dextrose, polyethylene glycol 
and other solutes. 

Liquid compositions can also contain liquid phases in addition to and to the 
exclusion of water. Exemplary of such additional liquid phases are glycerin, 
vegetable oils such as cottonseed oil, and water-oil emulsions. 
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A therapeutic composition typically contains an amount of a therapeutic 
material, i.e., a nucleotide sequence or adenovirus vector particle of the present 
invention, sufficient to deliver a therapeutically effective amount to the target 
tissue, typically an amount of at least 0.1 weight percent to about 90 weight percent 
of therapeutic material per weight of total therapeutic composition. A weight 
percent is a ratio by weight of therapeutic material, e.g., a nucleotide sequence, to 
total composition. Thus, for example, 0.1 weight percent is 0.1 grams of DNA 
segment per 1 00 grams of total composition. 

Other Applications 

The cell lines, viral vectors and methods of the present invention may also 
be used for purposes other than the direct administration of therapeutic nucleotide 
sequences. In one such application, the production of large quantities of 
biologically active proteins or polypeptides in cells transfected with the within- 
disclosed viral vectors is contemplated herein. For example, human 
lymphoblastoid cells may be transfected with a viral vector of the present invention 
carrying a human hematopoietic growth factor such as the gene for erythropoietin 
(EPO); cells so transfected are thus able to produce biologically active EPO. (See, 
e.g., Lopez etaL, Gene 148: 285-91, 1994). 

Various other applications and uses of the within-described methods, cell 
lines, plasmids, vectors, and compositions of the present invention shall become 
apparent upon closer examination of the Examples that follow. 
The following examples are intended to illustrate, but not limit, the present 
invention. As such, the following description provides details of the manner in 
which particular embodiments of the present invention may be made and used. 
This description, while exemplary of the present invention, is not to be construed 
as specifically limiting the invention. Variations and equivalents, now known or 
later developed, which would be within the understanding and technical 
competence of one skilled in this art are to be considered as falling within the 
scope of this invention. 

Example 1 

Preparation of Adenovirus Packaging Cell Lines 
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Cell lines that are commonly used for growing adenovirus are useful as host 
cells for the preparation of adenovirus packaging cell lines. Preferred cells include 
293 cells, an adenovirus-transformed human embryonic kidney cell line obtained 
from the ATCC, having Accession Number CRL 1573; HeLa, a human epithelial 
carcinoma cell line (ATCC Accession Number CCL-2); A549. a human lung 
carcinoma cell line (ATCC Accession Number CCL 1889); and the like epithelial- 
derived cell lines. As a result of the adenovirus transformation, the 293 cells 
contain the El early region regulatory gene. All cells were maintained in complete 
DMEM + 10% fetal calf serum unless otherwise noted. 

The cell lines of this invention allow for the production and propagation of 
novel adenovirus-based gene delivery vectors having deletions in preselected 
gene regions, that are obtained by cellular complementation of adenoviral genes. 
To provide the desired complementation of such deleted adenoviral genomes in 
order to generate a novel viral vector of the present invention, plasmid vectors that 
contain preselected functional units were designed as described herein. Such 
units include but are not limited to El early region, E4 and the viral fiber gene. The 
preparation of plasmids providing such complementation, thereby being 
"complementary plasmids or constructs," that are stably inserted into host cell 
chromosomes are described below. 

A. Preparation of an E4-Expressmg Plasmid for Complementation 



The viral E4 regulatory region contains a single transcription unit which is 
alternately spliced to produce several different mRNAs. The E4-expressing 
plasmid prepared as described herein and used to transfect the 293 cell line 
contains the entire E4 transcriptional unit as shown in Figure 1. A DNA fragment 
extending from 175 nucleotides upstream of the E4 transcriptional start site 
including the natural E4 promoter to 153 nucleotides downstream of the E4 
polyadenylation signal including the natural E4 terminator signal, corresponding to 
nucleotides 32667-35780 of the adenovirus type 5 (hereinafter referred to as Ad5) 
genome as described in Chroboczek et ai {Virol., 186:280-285 (1992), GenBank 
Accession Number M73260), was amplified from Ad5 genomic DNA, obtained from 
the ATCC, via the polymerase chain reaction (PCR). Sequences of the primers 
used were 5'CGGTACAC AGAATTC AGGAGACACAACTCC3' (forward or 5' primer 
referred to as E4L) (SEQ ID NO: 1) and 



5'GCC TGGATCC GGGAAGTTACGTAACGTGGGAAAAC3' (SEQ ID NO: 2) 



of E4-Gene-Deleted Adenoviruses 
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(backward or 3' primer referred to as E4R). To facilitate cloning of the PGR 
fragment, these oligonucleotides were designed to create novel sites for the 
restriction enzymes EcoRI and BamHI, respectively, as indicated with underlined 
nucleotides. DNA was amplified via PGR using 30 cycles of 92 G for 1 minute, 50 
C for 1 minute, and 72 C for 3 minutes resulting in amplified full-length E4 gene 
products. 

The amplified DNA E4 products were then digested with EcoRI and BamHI 
for cloning into the compatible sites of pBluescript/SK+ by standard techniques to 
create the plasmid pBS/E4. A 2603 base pair (bp) cassette including the herpes 
simplex virus thymidine kinase promoter, the hygromycin resistance gene, and the 
thymidine kinase polyadenylation signal was excised from the plasmid pMEP4 
(Invitrogen, San Diego, CA) by digestion with Fspl followed by addition of BamHI 
linkers (5'CGGGGATCGGCG3') (SEQ ID NO: 3) for subsequent digestion with 
BamHI to isolate the hygromycin-containing fragment. The isolated BamHI- 

modified fragment was then cloned into the BamHI site of pBS/E4 containing the 
E4 region to create the plasmid pE4/Hygro containing 8710 bp (Figure 2). The 
pE4/Hygro plasmid has been deposited with the ATCC as described in Example 3. 
The complete nucleotide sequence of pE4/Hygro is listed in SEQ ID NO: 4. 
Position number 1 of the linearized vector corresponds to approximately the middle 
portion of the pBS/SK+ backbone as shown in Figure 2 as a thin line between the 
3' BamHI site in the hygromycin insert and the 3' EcoRI site in the E4 insert. The 5' 
and 3' ends of the E4 gene are located at respective nucleotide positions 3820 and 
707 of SEQ ID NO: 4 while the 5' and 3" ends of the hygromycin insert are located 
at respective nucleotide positions 3830 and 6470. In the clone that was selected 
for use, the E4 and hygromycin resistance genes were divergently transcribed. 



wo 00/42208 



PCT/EPOO/00265 



-51- 

B. Preparation of a Fiber-Expressing Plasmid for 
Complementation of Fiber-Gene-Deleted Adenoviruses 

To prepare a fiber-encoding construct, primers were designed to amplify the 
fiber coding region from Ad5 genomic DNA with the addition of unique BamHI and 
Not! sites at the 5' and 3' ends of the fragment, respectively. The Ad5 nucleotide 
sequence is available with the GenBank Accession Number M18369. The 5' and 3' 
primers had the respective nucleotide sequences of 
S'ATG GGATCC AAGATGAAGCGCGCAAGACCGS' (SEQ ID NO: 5) and 
5»CATAACGCGGCCGCTTCTTTATTCTTGGGC3' (SEQ ID NO: 6), where the 
inserted BamHI and NotI sites are indicated by underlining. The 5' primer also 
contained a nucleotide substitution 3 nucleotides 5* of the second ATG codon (C to 
A) that is the initiation site. The nucleotide substitution was included so as to 
improve the consensus for initiation of fiber protein translation. 

The amplified DNA fragment was inserted into the BamHI and NotI sites of 
pcDNA3 (Invitrogen) to create the plasmid designated pCDNA3/Fiber having 7148 
bp, the plasmid map of which is shown in Figure 3. The parent plasmid contained 
the CMV promoter, the bovine growth hormone (BHG) terminator and the gene for 
conferring neomycin resistance. The viral sequence included in this construct 
corresponds to nucleotides 31040-32791 of the Ad5 genome. 

The complete nucleotide sequence of pCDNA3/Fiber is listed in SEQ ID NO: 
7 where the nucleotide position 1 corresponds to approximately the middle of the 
pcDNA3 vector sequence. The 5* and 3' ends of the fiber gene are located at 
respective nucleotide positions 91 6 . with ATG and 2661 with TAA. 

To enhance expression of fiber protein by the constitutive CMV promoter 
provided by the pcDNA vector, a Bglll fragment containing the tripartite leader 
(TPL) of adenovirus type 5 was excised from pRD112a (Sheay et al, 
BioTechniques, 15:856-862 (1993) and inserted into the BamHI site of 
pCDNA3/Fiber to create the plasmid pCLF having 7469 bp, the plasmid map of 
which is shown in Figure 4. The adenovirus tripartite leader sequence, present at 
the 5' end of all major late adenoviral mRNAs as described by Logan et aL, Proc. 
Natl. Acad. ScL, USA, 81:3655-3659 (1984) and Berkner, BioTechniques, 5:616- 
629 (1988), also referred to as a "partial TPL" since it contains a partial exon 1, 
shows correspondence with the AdS leader sequence having three spatially 
separated exons corresponding to nucleotide positions 6081-6089 (the 3' end of 
the first leader segment), 7111-7182 (the entire second leader segment), and 
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9644-9845 (the third leader segment and sequence downstream of that segment). 
The corresponding cDNA sequence of the partial tripartite leader sequence present 
in pCLF is listed in SEQ ID NO: 8 bordered by BamHI/Bglll 5' and 3' sites at 
respective nucleotide positions 907-912 to 1228-1233. The nucleotide sequence 
of an isolated partial TPL of the present invention is also listed separately as SEQ 
ID NO: 26 with the noted 5' and 3' restriction sites and with the following nucleotide 
regions identified: 1-6 nt Bglll site; 1-18 nt poiylinker. 19-27 nt last 9 nt of the first 
leader segment (exon 1); 28-99 nt second leader segment (exon 2); 100-187 nt 
third leader segment (exon 3); 188-301 nt contains the nt sequence immediately 
following the third leader in the genome with an unknown function; and 322-327 nt 
Bglll site. 

The pCLF plasmid has been deposited with the ATCC as described in 
Example 3. The complete nucleotide sequence of pCLF is listed in SEQ ID NO: 8 
where the nucleotide position 1 corresponds to approximately the middle of the 
pcDNA3 parent vector sequence. The 5' and 3 ends of the Ad5 fiber gene are 
located at respective nucleotide positions 1237-1239 with ATG and 2980-2982 with 
TAA The rest of the vector construct has been previously described above. 

C. Generation of an Adenovirus Packaging Cell Lme Carrying 

Plasmids Encoding Functional E4 and Fiber Proteins 



The 293 cell line was selected for preparing the first adenovirus packaging 
line as it already contains the E1 gene as prepared by Graham et al., J. Gen. Virol., 
36:59-74 (1977) and as further characterized by Spector, Virol.. 73a533-538 
(1983). Before electroporation, 293 cells were grown in RPMI medium + 10% fetal 
calf semm. Four x IqS cells were electroporated with 20 pg each of pE4/Hygro 
DNA and pCLF DNA using a BloRad GenePulser and settings of 300 V. 25 pF. 
DNA for electroporation was prepared using the Qiagen system according to the 
manufacturer's instructions (Bio-Rad, Richmond, OA). 

Following electroporation. cells were split into fresh complete DMEM + 10% 
fetal calf serum containing 200 pg/ml Hygromycin B (Sigma, St. Louis, MO). 

From expanded colonies, genomic DNA was isolated using the 
"MICROTURBOGEN" system (Invitrogen) according to manufacturer's instructions. 
The presence of integrated E4 DNA was assessed by PGR using the primer pair 
E4R and ORF6L (5'TGCTTAAGCGGCCGCGAAGGAGA AGTCC3') (SEQ ID NO: 
9), the latter of which Is a 5' fonward primer near adenovirus 5 open reading frame 
6. Refer to Figure 1 for position of the primers relative to the E4 genes. 
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One clone, designated 211, was selected exhibiting altered growth 
properties relative to that seen in parent cell line 293. The 211 clone contained the 
expected product, indicating the presence of inserted DNA corresponding to most, 
if not all, of the E4 fragment contained in the pE4/Hygro plasmid. The 211 cell line 
has been deposited with the ATCC as described in Example 3. This line was 
further evaluated by amplification using the primer pair E4LyE4R described above, 
and a product corresponding to the full-length E4 insert was detected. Genomic 
Southern blotting was performed on DNA restricted with EcoRI and BamHI. The E4 
fragment was then detected at approximately one copy/genome compared to 
standards with the EcoRI/BamHI E4 fragment as cloned into pBS/E4 for use as a 
labeled probe with the Genius system according to manufacturer's instructions 
(Boehringer Mannheim, Indianapolis, IN), In DNA from the 211 cell line, the 
expected labeled internal fragment pE4/Hygro hybridized with the isolated E4 
sequences. In addition, the probe hybridized to a larger fragment which may be 
the result of a second insertion event (Figure 5). 

Although the 211 cell line was not selected by neomycin resistance, thus 
indicating the absence of fiber gene, to confirm the lack of fiber gene, the 211 cell 
line was analyzed for expression of fiber protein by indirect immunofluorescence 
with an anti-fiber polyclonal antibody and a FITC-labeled anti-rabbit IgG (KPL) as 
secondary. No immunoreactivity was detected. Therefore, to generate 211 clones 
containing recombinant fiber genes, the 21 1 clone was expanded by growing in 
RPMI medium and subjected to additional electroporation with the fiber-encoding 
pCLF plasmid as described above. 

Following electroporation, cells were plated in DN/IEM + 10% fetal calf serum 
and colonies were selected with 200 pg/ml G418 (Gibco, Gaithersburg, MD). 
Positive cell lines remained hygromycin resistant. These candidate sublines of 21 1 
were then screened for fiber protein expression by indirect immunofluorescence as 
described above. The three sublines screened, 21 1 A, 21 IB and 21 1 R, along with 
a number of other sublines, all exhibited nuclear staining qualitatively comparable 
to the positive control of 293 cells infected with AdRSVpgal (1 pfu/cell) and stained 
24 hours post-infection. 

Lines positive for nuclear staining in this assay were then subjected to 
Western blot analysis under denaturing conditions using the same antibody. 
Several lines in which the antibody detected a protein of the expected molecular 
weight (62 kd for the Ad5 fiber protein) were selected for further study including 
211 A, 21 IB and 21 1R. The 211 A cell line has been deposited with ATCC as 
described in Example 3. 
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Immunoprecipitation analysis using soluble nuclear extracts from these three 
cell lines and a seminative electrophoresis system demonstrated that the fiber 
protein expressed is in the functional trimeric form characteristic of the native fiber 
protein as shown in Figure 6. The predicted molecular weight of a trimerized fiber 
is 186 kd. The lane marked 293 lacks fiber while the sublines contain detectable 
fiber. Under denaturing conditions, the trimeric form was destroyed resulting in 
detectable fiber monomers as shown in Figure 6. Those clones containing 
endogenous E1 , newly expressed recombinant E4 and fiber proteins were selected 
for use in complementing adenovirus gene delivery vectors having the 
corresponding adenoviral genes deleted as described in Example 2. 

D. Preparation of an E1 -Expressing Plasmid for 
Complementation of E1 -Gene-Deleted Adenoviruses 



In order to prepare adenoviral packaging cell lines other than those based 
on the E1-gene containing 293 cell line as described in Example 1C above, 
plasmid vectors containing E1 alone or in various combinations with E4 and fiber 
genes are constructed as described below. 

The region of the adenovirus genome containing the Ela and E1b gene is 
amplified from viral genomic DNA by PCR as previously described. The primers 
used are E1L, the 5' orfonward primer, and E1R, the 3' or backward primer, having 
the respective nucleotide sequences 5'CCG AGCTAGC GACTGAAAATGAG3' 
(SEQ ID NO: 10) and 5'CC TCTCGAG AGACAGC AAGACAC3' (SEQ ID NO: 11). 
The E1L and E1R primers include the respective restriction sites Nhel and Xhol as 
indicated by the underiines. The sites are used to clone the amplified El gene 
fragment into the Nhel/Xhol sites in pMAM commercially available from Clontech 
(Palo Alto, CA) to form the plasmid pDEX/E1 having 1 1 152 bp. the plasmid map of 
which is shown in Figure 7. 

The complete nucleotide sequence of pDEX/EI is listed in SEQ ID NO: 12 
where the nucleotide position 1 con-esponds to approximately 1454 nucleotides 
from the 3' end of the pMAM backbone vector sequence. The pDEX/EI plasmid 
includes nucleotides 552 to 4090 of the adenovirus genome positioned 
downstream (beginning at nucleotide position 1460 and ending at 4998 in the 
pDEX/EI plasmid) of the glucocorticoid-inducible mouse mammary tumor virus 
(MMTV) promoter of pMAM. The pMAM vector contains the E. coli gpt gene that 
allows stable transfectants to be isolated using 
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hypoxanthine/aminopterin/thymidine (HAT) selection. The pMAM backbone 
occupies nucleotide positions 1-1454 and 5005-1 1 152 of SEQ ID NO: 12. 

E. Generation of an Adenovirus Paclcaging Cell Line Carrying 
Plasmids Encoding Functional El, and Fiber Proteins 

To create separate adenovirus packaging cell lines equivalent to that of the 
211 sublines, 21 1A, 21 IB and 21 1R. as described in Example 1C, alternative cell 
lines lacking adenoviral genomes are selected for transfection with the plasmid 
constructs as described below. Acceptable host cells include A549, Hela, Vero 
and the like cell lines as described in Example 1. The selected cell line is 
transfected with the separate plasmids, pDEX/EI and pCLF, respectively for 
expressing El, and fiber complementary proteins. Following transfection 
procedures as previously described, clones containing stable insertions of the two 
plasmids are isolated by selection with neomycin and HAT. Integration of full- 
length copy of the El gene is assessed by PGR amplification from genomic DNA 
using the primer set EIUEIR , as described above. Functional insertion of the 
fiber gene is assayed by staining with the anti-fiber antibody as previously 
described. 

The resultant stably integrated cell line is then used as a packaging cell 
system to complement adenoviral gene delivery vectors having the corresponding 
adenoviral gene deletions as described in Example 2. 

Preparation of a Plasmid Containing Two or More Adenoviral 
Genes for Complementing Gene-Deleted 
Adenoviruses 

The methods described in the preceding Examples rely on the use of two 
plasmids, pE4/Hygro and pCLF, or, pCLF and pDEX/EI for generating adenoviral 
cell packaging systems. In alternative embodiments contemplated for use with the 
methods of this invention, complementing plasmids containing two or more 
adenoviral genes for expressing of encoded proteins in various combinations are 
also prepared as described below. The resultant plasmids are then used in various 
cell systems with delivery plasmids having the corresponding adenoviral gene 
deletions. The selection of packaging cell, content of the delivery plasmids and 
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content of the complementing plasmids for use in generating recombinant 
adenovirus viral vectors of tlnis invention thus depends on whether other adenoviral 
genes are deleted along with the adenoviral fiber gene, and, if so, which ones. 

1 . Preparation of a Complementing Plasmid Containing Fiber 

and E1 Adenoviral Genes 

A DNA fragment containing sequences for the CMV promoter, adenovirus 
tripartite leader, fiber gene and bovine growth homnone temiinator is amplified from 
pCLF prepared in Example IB using the fonward primer 
5'GACGGATCGGGAGATCTCC3' (SEQ ID NO: 13), that anneals to the nucleotides 
1-19 of the pCDNAS vector backbone in pCLF, and the backward primer 
S'CCGCCTCAGAAGCCATAGAGCCS' (SEQ ID NO: 14) that anneals to 
nucleotides 1278-1257 of the pCDNAS vector backbone. The fragment is amplified 
as previously described and then cloned into the pDEX/E1 plasmid, prepared in 
Example 1 D. For cloning in the DNA fragment, the pDEX/EI vector is first digested 
with Ndel, that cuts at a unique site in the pMAM vector backbone in pDEX/E1, 
then the ends are repaired by treatment with bacteriophage T4 polymerase and 
dNTPs. 

The resulting plasmid containing El and fiber genes, designated pEI/Fiber, 
provides both dexamethasone-inducible El function as described for DEX/E1 and 
expression of Ad5 fiber protein as described above. A schematic plasmid map of 
pE1 /Fiber, having 14455 bp, is shown in Figure 8. 

The complete nucleotide sequence of pE1/Fiber is listed in SEQ ID NO: 15 
where the nucleotide position 1 corresponds to approximately to 1459 nucleotides 
from the 3' end of the parent vector pMAM sequence. The 5' and 3 ends of the Ad5 
El gene are located at respective nucleotide positions 1460 and 4998 followed by 
pMAM backbone and then separated from the Ad5 fiber from pCLF by the filled-in 
blunt ended Ndel site. The 5' and 3' ends of the pCLF fiber gene fragment are 
located at respective nucleotide positions 10922-14223 containing elements as 
previously described for pCLF. 

The resultant pEI /Fiber plasmid is then used to complement one or more 
delivery plasmids expressing El and fiber. 

The pEI/Fiber construct is then used to transfect a selected host cell as 
described in Example 1E to generate stable chromosomal insertions preformed as 
previously described followed by selection on HAT medium. The stable cells are 
then used as packaging cells as described in Example 2. 
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Preparation of a Complementing Plasmid Containing E4 



and Fiber Adenoviral Genes 



pCLF prepared as described in Example IB is partially digested with Bglll to 
cut only at the site in the pCDNA3 backbone. The pE4/Hygro plasmid prepared in 
Example 1A is digested with BamHI to produce a fragment containing E4. The E4 
fragment is then inserted into the BamHI site of pCLF to form plasmid pE4/Fiber. 
The resultant plasmid provides expression of the fiber gene as described for pOLF 
and E4 function as described for pE4/Hygro. 

A schematic plasmid map of pE4/Fiber, having 10610 bp, is shown in Figure 
9. The complete nucleotide sequence of pE4/Fiber is listed in SEQ ID NO: 16 
where the nucleotide position 1 corresponds to approximately 14 bp from the 3' end 
of the parent vector pCDNAS backbone sequence. The 5* and 3 ends of the Ad5 
E4 gene are located at respective nucleotide positions 21 and 3149 followed by 
fused Bglll/BamHI sites and pCDNA3 backbone including the CMV promoter again 
followed by Bglll/BamHI sites. The adenovirus leader sequence begins at 
nucleotide position 4051 and extends to 4366 followed by fused BamHI/Bglll sites 
and the 5' and 3' ends of the fiber gene located at respective nucleotide positions 
4372 and 6124. 

Stable chromosonai insertions of pE4/Fiber in host cells are obtained as 
described above. 



Adenoviral delivery vectors of this invention are prepared to separately lack 
the combinations of El/fiber and E4/fiber. Such vectors are more replication- 
defective than those previously in use due to the absence of multiple viral genes. 
A preferred adenoviral delivery vector of this invention that is replication competent 
but only via a non-fiber means is one that only lacks the fiber gene but contains the 
remaining functional adenoviral regulatory and structural genes. Furthermore, the 
adenovirus delivery vectors of this invention have a higher capacity for insertion of 
foreign DNA. 



Example 2 



Preparation of Adenoviral Gene Delivery Vectors 
Using Adenoviral Packaging Cell Lines 
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A. Preparation of Adenoviral Gene Delivery Vectors Having 
Specific Gene Deletions and Methods of Use 

To construct the El/ fiber deleted viral vector containing the LacZ reporter 
gene construct, two new plasmids were constructed. The plasmid pA ElBpgal was 
constructed as follows. A DNA fragment containing the SV40 regulatory 
sequences and E. coli p-galactosidase gene was isolated from pSVpgal (Promega) 
by digesting with Vspl. filling the overhanging ends by treatment with Klenow 
fragment of DNA polymerase I in the presence of dNTP's and digesting with Bam 
H1. The resulting fragment was cloned into the EcoRV and BamHI sites in the 
polylinker of pA E1sp1B (Microbix Biosystems, Hamilton, Ontario) to form 
pAEIBpgal that therefore contained the left end of the adenovirus genome with the 
Ela region replaced by the LacZ cassette (nucleotides 6690 to 4151) of pSVpgaL 
Plasmid DNA may be prepared by the alkaline lysis method as described by 
Birnboim and Doly, Nuc. Acids Res., 7:1513-1523 (1978) or by the Quiagen 
method according to the manufacturer's instmction, from transformed ceils used to 
expand the plasmid DNA was then purified by CsCI-ethidium bromide density 
gradient centrifugation. Alternatively, plasmid DNAs may be purified from E. coli by 
standard methods known in the art (e.g. see Sambrook etal,) 

The second plasmid (pDV44), prepared as described herein, is derived from 
pBHGIO, a vector prepared as described by Bett et al., Proc. NatL Acad. Sc/., USA, 
91:8802-8806 (1994), now described in International Application Publication 
Number WO 9500655, with methodology well known to one of ordinary skill in the 
art and also is commercially available from Microbix, which contains an Ad5 
genome with the packaging signals at the left end deleted and the E3 region 
(nucleotides 28133:30818) replaced by a linker with a unique site for the restriction 
enzyme Pad. An 11.9 kb BamHI fragment, which contains the right end of the 
adenovirus genome, is isolated from pBHGIO and cloned into the BamHI site of 
pBS/SK(+) to create plasmid pi 1.3 having approximately 14,658 bp. A schematic 
of the plasmid map is shown in Figure 13. The p11.3 plasmid was then digested 
with Pad and Sail to remove the fiber, E4, and inverted tenninal repeat (ITR) 
sequences. 

This fragment was replaced with a 3,4 kb fragment containing the ITR 
segments and the E4 gene which was generated by PGR amplification from 
pBHGIO using the following oligonucleotide sequences (5' TGTACACCG 
GATCCGGCGCACACC3' SEQ ID NO: 17) and (5'CACAACGAGGTC 
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AATTAATTAATTGCCACATCCTC3' SEQ ID NO: 18). These primers incorporated 
sites for Pad and BamHI. Cloning this fragment into the Pad and blunt ended Sail 
sites of the pi 1.3 backbone resulted in a substitution of the fused ITRs, E4 region 
and fiber gene present in pBHGIO, by the ITRs and E4 region alone. The resultant 
pi 1 .3 plasmid containing the ITR and E4 regions, now called plasmid pDV43a, was 
then digested with BamHI. This BamHI fragment was then used to replace a 
BamHI fragment in pBHGIO thereby creating pDV44 in a pBHGIO backbone. 

In an altemative approach to preparing pDV44 with an additional subcloning 
step to facilitate the incorporation of restriction cloning sites, the following cloning 
procedure was performed. pDV44 as above was constructed by removing the fiber 
gene and some of the residual E3 sequences from pBHGIO (Microbix Biosystems). 
As above, to simplify manipulations, the 11.9 kb BamHI fragment including the 
rightmost part of the Ad5 genome was removed from pBHGIO and inserted into 
pBS/SK. The resulting plasmid was termed pi 1.3. The 3.4 kb DNA fragment 
corresponding to the E4 region and both ITRs of adenovirus type 5 was amplified 
as described above from pBHGIO using the oligonucleotides listed above and 
subcloned into the vector pCR2.1 (Invitrogen) to create pDV42. This step is the 
additional cloning step to facilitate the incorporation of a Sail restriction site. 
pDV42 was then digested with Pad, which cuts at a unique site (bold type) in one 
of the PGR primers, and with Sail, which cuts at a unique site in the pCR2.1 
polylinker. This fragment was used to replace the corresponding Pacl/Xhol 
fragment of pi 1.3 (the pBS polylinker adjacent to the Ad DNA fragment contains a 
unique Xhol site), creating pDV43. Finally, pDV44 was constructed by replacing 
the 11.9 kb BamHI fragment of pBHGIO by the analogous BamHI fragment of 
pDV43- As generated in the first procedure, pDV44 therefore differs from pBHGIO 
by the deletion of Ad5 nucleotides 30819:32743 (residual E3 sequences and all but 
the 3'-most 41 nucleotides of the fiber open reading frame). 

Thus, to summarize, the cloning procedures described above result in the 
production of a fiber-deleted Ad5 genomic plasmid (pDV44) that was constructed 
by removing the fiber gene and some of the residual E3 sequences from pBHGIO 
(Figure 16A). pDV44 contains a wild-type E4 region, but only the last 41 
nucleotides of the fiber ORF (this sequence was retained to avoid affecting 
expression of the adjacent E4 transcription unit). Both pBHGIO and pDV44 contain 
unpackageable Ad5 genomes, and must be rescued by cotransfection and 
subsequent homologous recombination with DNA carrying functional packaging 
signals. In order to generate vectors marked with a reporter gene, either pDV44 or 
pBHGIO was cotransfected with pAE1B3gal, which contains the left end of the Ad5 
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genome with an SV40-driven 6-galactosidase reporter gene inserted in place of the 
El region. 

In general, and as described below, the method for virus production by 
recombination of plasmids followed by complementation in cell culture involves the 
isolation of recombinant viruses by cotransfection of any one of the adenovirus 
packaging cell systems prepared in Example 1, namely 211 A, 21 IB, 21 1R, A549, 
Vero cells, and the like, with plasmids carrying sequences corresponding to viral 
gene delivery vectors. 

A selected cell line is plated in dishes and cotransfected with pDV44 and 
pAE1B3gal using the calcium phosphate method as described by Sett et ai, Proc. 
Natl. Acad ScL, USA, 97:8802-8806 (1994). Recombination between the 
overlapping adenovirus sequences in the two plasmids leads to the creation of a 
full-length viral chromosome where pDV44 and pAEIBRgal recombine to form a 
recombinant adenovirus vector having multiple deletions. The deletion of El and 
of the fiber gene from the viral chromosome is compensated for by the sequences 
integrated into the packaging cell genome, and infectious virus particles are 
produced. The plaques thus generated are isolated and stocks of the recombinant 
virus are produced by standard methods. 

A pDV44-derived virus is expected to be replication-defective due to the 
fiber deletion, so that the cells in which it is grown must complement this defect. 
The 21 1 B cell line (a derivative of 293 cells which expresses the wild-type (wt) ADS 
fiber and is equivalent to 21 1 A on deposit with ATCC as described in Example 3) 
was used for rescue and propagation of the virus described here, pDV44 and 
pAE1B3gal were cotransfected into 21 IB cells, and the monolayers were observed 
for evidence of cytopathic effect (CPE). Briefly, for virus construction, cells were 
transfected with the indicated plasmids using the Gibco Calcium Phosphate 
Transfection system according to the manufacturer's instructions and observed 
daily for evidence of CPE. 

One of a total of 58 transfected dishes showed evidence of spreading cell 
death at day 15. A crude freeze-thaw lysate was prepared from these cells and the 
resulting virus (termed Ad5.Rgal.AF) was plaque purified twice and then expanded. 
To prepare purified viral preparations, cells were infected with the indicated Ad and 
observed for completion of CPE. Briefly, at day zero, 21 1B cells were plated in 
DMEM plus 10% fetal calf serum at approximately 1X10^ cells/150 cm^ flask or 
equivalent density. At day one, the medium was replaced with one half the original 
volume of fresh DMEM containing the indicated Ad, in this case Ad5.l3gal.AF. at 
approximately 100 particles/cell. At day two, an equal volume of medium was 
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added to each flask and the cells were observed for CPE. Two to five days after 
infection, cells were collected and virus isolated by lysis via four rapid freeze-thaw 
cycles. Virus was then purified by centrifugation on preformed 15-40% CsCI 
gradients (111,000 x g for three hours at 4^*0). The bands were harvested, 
dialyzed into storage buffer (10 mM Tris-pH 8.1, 0.9% NaCI, and 10% glycerol), 
aliquoted and stored at - 70°C. Purified Ad5.l3gal.AF virus particles containing 
human adenovirus Ad5.3gal. AFgenome (described further below) have been 
deposited with the ATCC on January 15, 1999 as further described in Example 3. 

For viral titering, as necessary in the below Examples, Ad preparations were 
titered by plaque assay on 21 1B cells. Cells were plated on polylysine-coated 6 
well plates at 1.5 x 10^ cells/welL Duplicate dilutions of virus stock were added to 
the plates in 1 ml/well of complete DMEM. After a five hour incubation at 37°C, 
virus was removed and the wells overlaid with 2 ml of 0.6% low-melting agarose in 
Medium 199 (Gibco). An additional 1 ml of overlay was added at five day intervals. 

As a control, the first-generation virus Ad5.B gal.wt, which is identical to 
AdS.Bgal. AF except for the fiber deletion, was constructed by cotransfection of 
pBHGIO and pAEIBBgal (Figure 168). In contrast to the low efficiency of recovery 
of the fiberless genome (1/58 dishes), all of 9 dishes cotransfected with pAEIBfigal 
and pBHGIO produced virus. 

In a preferred embodiment of this invention as more fully described herein 
and below, a delivery plasmid is prepared that does not require the above- 
described recombination events to prepare a viral vector having a fiber gene 
deletion. In one embodiment, a single delivery plasmid containing all the 
adenoviral genome necessary for packaging but lacking the fiber gene is prepared 
from plasmid pFG140 containing full-length Ad5 that is commercially available from 
Microbix. The resultant delivery plasmid referred to as pFG140-f is then used with 
pCLF stably integrated cells as described above to prepare a viral vector lacking 
fiber. In a preferred aspect of this invention, the fiber gene is replaced with a 
therapeutic gene of interest for preparing a therapeutic delivery adenoviral vector. 
Other embodiments including production of fiberless vector with a complete TPL 
are described in Example 5. 

Vectors for the delivery of any desired gene and preferably a therapeutic 
gene are prepared by cloning the gene of interest into the multiple cloning sites in 
the polylinker of commercially available pAEIsplB (Microbix Biosystems), in an 
analogous manner as performed for preparing p ElBpgal as described above. The 
same cotransfection and recombination procedure is then followed as described 
herein to obtain viral gene delivery vectors as further discussed in later Examples. 
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Characterization of the Ad5.pgal.AF Genome 



To confirm that the vector genomes had the expected structures and that the 
fiber gene was absent from the Ad5.BgaI.CDF chromosome, the DNA isolated from 
viral particles was analyzed. Briefly, purified viral DNA was obtained by adding 10 
pi of 10 mg/ml proteinase K, 40 pi of 0.5 M EDTA and 50 pi of 10% SDS to 800 pi 
of adenovirus-containing culture supernatant. The suspension was then incubated 
at 55C for 60 minutes. The solution was then extracted once with 
400 pi of a 24:1 mixture of chloroform:isoamyl alcohol. The aqueous phase was 
then removed and precipitated with sodium acetate/ethanol. The pellet was 
washed once with 70% ethanol and lightly dried. The pellet was then suspended in 
40 pi of 10 mM Tris-HCI, pH 8.0, 1 mM EDTA. Genomic DNA from both 
Ad5.Bgal.wt and Ad5.Pgal.AF produced the expected restriction patterns (Figure 
17A) following digestion with either EcoRI (Figure 17B) or with Ndel (data not 
shown). Southern blotting, performed with standard methods, with labeled fiber 
DNA as a probe demonstrated the presence of fiber sequence in Ad5.Bgal.wt but 
not in AdS.Bgal. AF DNA (Figure 17C). As a positive control, the blot was stripped 
and reprobed with labeled E4 sequence. Fiber and E4 sequences were detected 
by using labeled inserts from pCLF and pE4/Hygro, respectively. As expected, E4 
signal was readily detectable in both genomes at equal intensities (Figure 17C). 

The complete nucleotide sequence of Ad5.pgal.AF is presented in SEQ ID 
NO: 27 and is contained in the virus particle on deposit with ATCC. 



To verify that Ad5.Bgal.AF was fiber-defective, 293 cells (which are 
permissive for growth of El -deleted Ad vectors but do not express fiber) were 
Infected with AdS.Bgal. AF or with Ad5.3gal.wt. Twenty-four hours post infection, 
the cells were stained with polyclonal antibodies directed either against fiber or 
against the penton base protein. Ceils infected with either virus were stained by 
the anti-penton base antibody, while only cells infected with the Ad5,Bgal.wt control 
virus reacted with the anti-fiber antibody. This confirms that the fiber-deleted Ad 
mutant does not direct the synthesis of fiber protein. 



2, 



Characterization of the Fiberless Adenovirus Ad5.pgat.AF 



3. Growth of the Fiber-Deleted Ad5.pgal.AF Vector in 
Complementing Cells 
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Ad5.pgal.AF was found to readily be propagated in 21 1B cells. As assayed 
by protein concentration, CsCl-purified stocks of either Ad5.pgal.AF or Ad5.Bgal.wt 
contained similar numbers of viral particles (Table 1), and the particles appeared to 
band normally on CsCI gradients. However, infectivity of the Ad5.pgal.AF particles 
was lower than the AdS.Qgal.wt control, as indicated by an increased particle/PFU 
ratio (Table 1). This is likely due to a reduced amount of fiber protein incorporated 
into mutant particles during growth in the 
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Virus 


CsCI- 
purified 
prepn 


Cell 
line 


Particles/ 




Particle 
/PFU 
ratio 


Fiber source 


AdS.pgal.wt 


1 


211 
B 


7.4 X 10" 


7.5 X 
10^° 


10 


Ad 

chromosom 
e 




2 


211 
B 


3.0 X 10'' 


5.0x10^ 


60 


Ad 

chromosom 
e 


AdS.pgaLAF 


3 


211 
B 


7.7 X 10'' 


3.5 x 10^ 


2200 


Packaging 
cells 




4 


211 
B 


1.9 X 10"" 


2.3 X 10^ 


808 


Packaging 
cells 




5 


293 


4.5 X 10" 


9.5 X 10** 


47400 


None 




6 


293 


3.4 X 10" 


3.5 X lO'' 


9700 


None 



^Calculated from viral protein concentration (lug of protein = 4x10® particles). 
^Assayed by plaguing on 21 1 B cells. 

Table 3 

*Partlcle numbers and infectious titers of representative adenovirus preps. Each 
line represents a single CsCl-purified preparation of the indicated virus. Particle numbers 
were calculated from viral protein concentration (1 ijg protein = 4x10® particles). Pfu w/as 
assayed by plaquing on 21 IB cells (see above). 

21 IB cells (see below). Ad5.pgal.AF was also found to plaque more slowly than the 
control virus. When plated on 2118 cells. AdS.Rgal.wt plaques appeared within 5-7 
days, while plaques of Ad5.pgal.AF continued to appear until as much as 15-18 
days post infection. Despite their slower formation, the morphology of AdS.pgaLAF 
plaques was essentially normal. 
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4. Production of Fiberless AdS.Bgal. AF Particles 

As Ad5.pgal.AF represents a true fiber null mutation and its stocks are free of 
helper virus, the fiber mutant phenotype was readily investigated. A single round of 
growth in cells (such as 293) which do not produce fiber generating a homogeneous 
preparation of fiberless Ad allowed for the determination of whether such particles 
would be stable and/or infectious. Either Ad5.Bgal.wt or Ad5.pgal.AF was grown in 
293 or 21 1B cells, and the resulting particles purified on CsCI gradients as 
previously described. Ad5.pgal.AF particles were readily produced in 293 cells at 
approximately the same level as the control virus and behaved similarly on the 
gradients, indicating that there was not a gross defect in morphogenesis of fiberless 
capsids (Table 1). 

As shown in Figure 18, particles of either virus contained similar amounts of 
penton base regardless of the cell type in which they were grown. This 
demonstrated that fiber is not required for assembly of the penton base complex into 
virions. However, as predicted, the Ad5.pgal.AF particles produced in 293 cells did 
not contain fiber protein. 21 1 B-grown Ad5.pgal.AF also contained less fiber than 
the Ad5.Bgal.wt control virus (Figure 18). Importantly, the infectivities of the 
different viral preparations on epithelial cells (Table 1) correlated with the amount of 
fiber protein present. The fiberless Ad particles were several thousand-fold less 
infectious than the first-generation vector control on a per-particle basis, while 
infectivity of 21 1 B-grown Ad5.pgal.AF was only 50-100 fold less than that of 
Ad5.Bgal.wt. These studies confirmed fiber's crucial role in 
infection of epithelial cells via CAR binding. 

5. Composition and Structure of the Fiberless AdS.Dgal. 
AF Particles 

The proteins contained in particles of 293-grown Ad5.pgal.AF were compared 
to those in Ad5.Bgal.wt, to detemnine whether proteolysis or particle assembly was 
defective in this fiber null mutant (data not shown). The overall pattern of proteins in 
the fiberless particles was observed to be quite similar to that of a first-generation 
vector, with the exception of reduced intensity of the composite band resulting from 
both proteins Ilia and IV (fiber) (data not shown). The fiberless particles also had a 
reduced level of protein VII. Although substantial amounts of uncleaved precursors 
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to proteins VI. VII, and VIII were not seen, it is possible that the low-molecular weight 
bands migrating ahead of protein VII represent either aberrantly cleaved viral 
proteins or their breakdown products. 

Cryo-eiectron nnicroscopy was used to more closely examine the structure of 
the 293 grown Ad5.pgal.AF and of AdSBgal.wt. The fiber, which consists of an 
extended stalk with a knob at the end» was faintly visible in favorable orientations of 
wild-type Ad5 particles, but not in images of the fiberless particles (data not shown). 
Filamentous material likely corresponding to free viral DNA was seen in micrographs 
of fiberless particles. This material was also present in micrographs of the 
first-generation control virus, albeit at much lower levels. 

Three-dimensional image reconstructions of fiberless and wild-type particles 
at -20 A resolution showed similar sizes and overall features, with the exception that 
fiberless particles lacked density corresponding to the fiber protein. The densities 
corresponding to other capsid proteins, including penton base and proteins Ilia. VI, 
and IX, were comparable in the two structures. This confirms that absence of fiber 
does not prevent assembly of these components into virions. The fiber was 
truncated in the wild-type structure as only the lower portion of its flexible shaft 
follows icosahedral symmetry. The RGD protrusions on the fibertess penton base 
were angled slightly inward relative to those of the wild-type structure. Another 
difference between the two penton base proteins was that there is a -30 A diameter 
depression in the fiberless penton base around the five-fold axis where the fiber 
would normally sit. The Ad5 reconstructions confirm that capsid assembly, 
including addition of penton base to the vertices, is able to proceed In the complete 
absence of fiber. 

6. Integrin-Dependent Infectivity of Fiberless AdS.Dgal. AF 
Particles 

While attachment via the viral fiber protein is a critical step in the infection of 
epithelial cells, an alternative pathway for infection of certain hematopoietic cells has 
been described. In this case, penton base mediates both binding to the cells (via B2 
integrins) and internalization (through interaction with avintegrins). Particles lacking 
fiber might therefore be expected to be competent for infection of these cells, even 
though on a per-particle basis they are several thousand-fold less infectious than 
normal Ad vectors on epithelial cells. 
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To investigate this, THP-1 monocytic cells were infected with Ad5.Bgal.wt or 
with Ad5.pgal.AF grown in the absence of fiber. Infection of THP-1 cells was 
assayed by infecting 2 x 10' cells at the indicated m.o.i. in 0.5 ml of complete RPMI. 
Forty-eight hours post-infection, the cells were fixed with glutaraldehyde and stained 
with X-gal, and the pert;entage of stained cells was determined by light microscopy. 

The results of the infection assay showed that the fiberless particles were 
only a few-fold less infectious than first-generation Ad on THP-1 cells (Figure 19A). 
In contrast to this, very large differences were seen in plaquing efficiency on 
epithelial (21 IB) cells (Table 1). Infection of THP-1 cells by either Ad5.pgal.AF or 
Ad5.Bgal.wt was not blocked by an excess of soluble recombinant fiber protein, but 
could be Inhibited by the addition of recombinant penton base (Figure 19B). These 
results indicate that the fiberless Ad particles use a fiber-independent pathway to 
infect these cells. Furthennore. the lack of fiber protein did not prevent Ad5.pgal.AF 
from internalizing into the cells and delivering its genome to the nucleus, 
demonstrating that fiberless particles are properly assembled and are capable of 
uncoating. 

The foregoing results with the recombinant viruses thus produced indicates 
that they can be used as gene delivery tools both in cultured cells and in vivo as 
described more fully in the Examples. For example, for studies of the effectiveness 
and relative immunogenicity of multiply-deleted vectors, virus particles are produced 
by growth in the packaging lines described in Example 1 and are purified by CsCI 
gradient centrifugation. Following titering, virus particles are administered to mice 
via systemic or local injection or by aerosol delivery to lung. The LacZ reporter gene 
allows the number and type of cells which are successfully transduced to be 
evaluated. The duration of transgene expression is evaluated in order to determine 
the long-term effectiveness of treatment with multiply-deleted recombinant 
adenoviruses relative to the standard technologies which have been used in clinical 
trials to date. The immune response to the improved vectors described here is 
determined by assessing parameters such as inflammation, production of cytotoxic T 
lymphocytes directed against the vector, and the nature and magnitude of the 
antibody response directed against viral proteins. 

Versions of the vectors which contain therapeutic genes such as CFTR for 
treatment of cystic fibrosis or tumor suppressor genes for cancer treatment are 
evaluated in the animal system for safety and efficiency of gene transfer and 
expression. Following this evaluation, they are used as experimental therapeutic 
agents in human clinical trials. 
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B. Retargeting of Adenoviral Gene Delivery Vectors by Producing 

Viral Particles Containing Different or Altered Fiber Proteins 

As the specificity of adenovirus binding to target cells is largely determined by 
the fiber protein, viral particles that incorporate modified fiber proteins or fiber 
proteins from different adenoviral serotypes (pseudotyped vectors) have different 
specificities. Thus, the methods of expression of the native Ad5 fiber protein in 
adenovirus packaging cells as described above is also applicable to production of 
different fiber proteins. 

In one aspect of invention, chimeric fiber proteins are produced according to 
the methods of Stevenson a/., J. Virol., 65:2850-2857 (1995). The authors 
showed that the determinants for fiber receptor binding activity are located in the 
head domain of the fiber and that isolated head domain is capable of trimerization 
and binding to cellular receptors. The head domains of adenovirus type 3 (Ad3) and 
Ad5 were exchanged in order to produce chimeric fiber proteins. Similar constructs 
for encoding chimeric fiber proteins for use in the methods of this invention are 
contemplated. Thus, instead of the using the intact AdS fiber-encoding construct 
prepared in Example 1 as a complementing viral vector in adenoviral packaging 
cells, the constructs described herein are used to transfect cells along with E4 
and/or E1 -encoding constructs. 

Briefly, full-length AdS and AdS fiber genes were amplified from purified 
adenovirus genomic DNA as a template. The AdS and AdS nucleotides sequences 
are available with the respective GenBank Accession Numbers M18369 and 
M1241 1 . Oligonucleotide primers are designed to amplify the entire coding 
sequence of the full-length fiber genes, starting from the start codon, ATG, and 
ending with the termination codon TAA. For cloning purposes, the 5' and 3' primers 
contain the respective restriction sites BamHI and NotI for cloning into pcDNA 
plasmid as described in Example 1A. PGR is performed as described above. 

The resultant products are then used to construct chimeric fiber constructs by 
PGR gene overlap extension, as described by Horton etaL, BioTechniques, S:525- 
535 (1990). The AdS fiber tail and shaft regions (STS; the nucleotide region 
encoding amino acid residue positions 1 to 403) are connected to the AdS fiber head 
region (3H; the nucleotide region encoding amino acid residue positions 136 to 319) 
to form the 5TS3H fiber chimera. Gonversely, the AdS fiber tail and shaft regions 
(STS; the nucleotide region encoding amino acid residues positions 1 to 135) are 
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connected to the Ad5 fiber head region (5H; the nucleotide region encoding the 
amino acid residue positions 404 to 581) to form the 3TS5H fiber chimera. The 
fusions are made at the conserved TLWT (SEQ ID NO: 19) sequence at the fiber 
shaft-head junction. 

The resultant chimeric fiber PGR products are then digested with BamHI and 
NotI for separate directional ligation into a similarly digested pcDNA 3.1 . The TPL 
sequence is then subcloned into the BamHI as described in Example 1 A for 
preparing an expression vector for subsequent transfection into 211 cells as 
described above or into the alternative packaging cell systems as previously 
described. The resultant chimeric fiber construct-containing adenoviral packaging 
cell lines are then used to complement adenoviral delivery vectors as previously 
described. Other fiber chimeric constructs are obtained using a similar approach 
with the various adenovirus serotypes known. 

In an alternative embodiment, the methods of this invention contemplate the 
use of the modified proteins including novel epitopes as described by Michael et al.. 
Gene Therapy, 2:660-668 (1995) and in International Publication WO 95/26412. 
Both publications describe the construction of a cell-type specific therapeutic viral 
vector having a new binding specificity incorporated into the virus concurrent with 
the destruction of the endogenous viral binding specificity. In particular, the authors 
described the production of an adenoviral vector encoding a gastrin releasing 
peptide (GRP) at the 3' end of the coding sequence of the AdS fiber gene. The 
resulting fiber-GRP fusion protein was expressed and shown to assemble functional 
fiber trimers that were correctly transported to the nucleus of HeLa cells following 
synthesis. 

Based on the teachings in the paper and International Publication, similar 
constructs are contemplated for use in the complementing adenoviral packaging cell 
systems of this invention for generating new adenoviral gene delivery vectors that 
are targetable, replication-deficient and less immunogenic. Heterologous ligands 
contemplated for use herein to redirect fiber specificity range from as few as 10 
amino acids in size to large globular structures, some of which necessitate the 
addition of a spacer region so as to reduce or preclude steric hindrance of the 
heterologous ligand with the fiber or prevent trimerization of the fiber protein. The 
ligands are inserted at the end or within the linker region. Preferred ligands include 
those that target specific cell receptors or those that are used for coupling to other 
moieties such as biotin and avidin. 
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A preferred spacer includes a short 12 amino acid peptide linker composed of 
a series of serines and alanine flanked by a proline residue at each end. One of 
ordinary skill in the art is familiar with the preparation of linkers to accomplish 
sufficient protein presentation and for altering the binding specificity of the fiber 
protein without compromising the cellular events that follow viral internalization. 
Moreover, within the context of this invention, preparation of modified fibers having 
ligands positioned internally within the fiber protein and at the carboxy terminus as 
described below are contemplated for use with the methods described herein. 

The preparation of a fiber having a heterologous binding ligand is prepared 
essentially as described in the above-cited paper. Briefly, for the ligand of choice, 
site-directed mutagenesis is used to insert the coding sequence for a linker into the 
3' end of the Ad5 fiber construct in pCLF as prepared in Example 1 . 

The 3' or antisense or mutagenic oligonucleotide encodes a preferred linker 
sequence of ProSerAlaSerAlaSerAlaSerAlaProGlySer (SEQ ID NO: 20) followed by 
a unique restriction site and two stop codons, respectively, to allow the insertion of a 
coding sequence for a selected heterologous ligand and to ensure proper translation 
termination. Flanking this linker sequence, the mutagenic oligonucloetide contains 
sequences that overlap with the vector sequence and alio its incorporation into the 
construct. Following mutagenesis of the pCLF sequence adding the linker and stop 
codon sequences, a nucleotide sequence encoding a preselected ligand is obtained, 
linkers corresponding to the unique restriction site in the modified construct are 
attached and then the sequence is cloned into linearized corresponding restriction 
site. The resultant fiber-ligand construct is then used to transfect 21 1 or the 
alternative cell packaging systems previously described to produce complementing 
viral vector packaging systems for use with the methods of this invention. 

In a further embodiment, intact fiber genes from different Ad serotypes are 
expressed by 21 1 cells or an alternative packaging system as previously described. 
A gene encoding the fiber protein of interest is first cloned to create a plasmid 
analogous to pCLF, and stable cell lines producing the fiber protein are generated 
as described above for Ad5 fiber. The adenovirus vector described which lacks the 
fiber gene is then propagated in the cell line producing the fiber protein relevant for 
the purpose at hand. As the only fiber gene present is the one in the packaging 
cells, the adenoviruses produced contain only the fiber protein of interest and 
therefore have the binding specificity conferred by the complementing protein. Such 
viral particles are used in studies such as those described above to detemnine their 
properties in experimental animal systems. 
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Example 3 
Deposit of Materials 

The following cell lines and plasmids have been deposited on September 25, 
1996, with the American Type Culture Collection, 10801 University Blvd, Manassas, 
Virginia, USA (ATCC) under the provisions of the Budapest Treaty on the 
International Recognition of the Deposit of Microorganisms for the Purpose of Patent 
Procedure and the Regulations thereunder (Budapest Treaty): PI asmid pE4/Hygro 
(accession number 97739). Plasmid pCLF (accession number 97737), 21 1 Cell Line 
(accession number CRL-12193) and 21 1 A Cell Line (accession number CRL-12194) 

The following virus, Ad5.pgal.AF , deposit was deposited on January 15, 
1999, with the ATCC as listed above and provided with accession number VR2636. 

Additionally, plasmids pDV60, pDV67, pDV69. pDVBO and pDV90 were also 
deposited at the ATCC on 5 January. 2000 and provided with accession numbers 
PTA-1144, PTA-1145. PTA-1146, PTA-1147 and PTA-1148 respectively. 

Example 4 

Complementation of Fiber-Defective and Fiber-Modified Virus 

The native fiber protein is a homotrimer (Henry LJ. ef a/., J, ViroL 
58:5239-5246 (1994)), and trimerization is essential for assembly of the penton/fiber 
complex (Novelli A ef a/., J. Bioi Chem. 256:9299-9303 (1991)). To assess the 
multimeric structure of the recombinant fiber protein produced by the cell lines, cells 
were labeled with 50 pCi/ml [^S] Translabel (ICN) for two hours at 37°C» lysed in 
RlPA buffer, and fiber protein was immunopreclpitated as described (Harlow E et a., 
Antibodies. Cold Spring Harbour Laboratory, Cold Spring Harbor (1988). Immune 
complexes were collected on Protein A-Sepharose beads (Pierce), extensively 
washed with RIPA buffer, and incubated at room temperature in 0.1 M triethylamine, 
pH 1 1 .5 to release bound fiber protein, A portion of the precipitated fiber was 
electrophoresed on a 8% SDS-PAGE gel under denaturing (1% SDS in loading 
buffer, samples boiled for 5 minutes) or semi-native (0.1% SDS in loading buffer, 
samples not heated) conditions. 

As seen in Fig. 13, lines 21 1 A, 21 1 B, and 21 1 R, but not the control 293 cells, 
expressed an immunologically reactive protein which migrated at the predicted 
molecular weight for trimer (1 86 kD) under seminative conditions and for monomer 
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(62 kD) under denaturing conditions. The behavior of the precipitated fiber was 
indistinguishable from that of purified baculovirus-produced recombinant Ad2 fiber 
(Wickham T et aL, Cell 73:309-319 (1993)) (the 58 kD Ad2 and 62 kD Ad5 fibers 
have very similar mobilities under these conditions). 

To determine whether the fiber-expressing lines could support the growth of a 
fiber-defective adenovirus, we performed one-step growth experiments using the 
temperature-sensitive fiber mutant Ad H5fs142 (the gift of Harold Ginsberg). At the 
restrictive temperature (Sg.S'^C), this mutant produces an underglycoslyated fiber 
protein which is not incorporated into mature virions (Chee-Sheung C. C ef a/,, J. 
Virol 42: 932-950 (1982)). This results in the accumulation of non-infectious viral 
particles. We asked whether the recombinant fiber protein expressed by our cell 
lines could complement the H5te142 defect and rescue viral growth. 

Cell lines 293, 21 1 A, 21 1 B and 21 1 R (2 x 10® cells/sample) were infected with 
H5fs142 at 10 pfu/cell. 48 hours later, cells were detached with 25 mM EDTA and 
virus was harvested by four rapid freeze-thaw cycles. Debris was removed by a 10 
minute spin at 1500 x g, and viral titers determined by fluorescent focus assay (Thiel 
J.F ef a/., Proc. Soc. Exp. Biol. Med, 725:892-895 (1967)) on SW480 cells with a 
polyclonal anti-penton base Ab (Wickham T et aL, Cell 75:309-31 9 (1 993)). As 
shown in Fig. 14, the fiber mutant virus replicated to high titers in 293 cells at 32.5°C 
(the permissive temperature), but to a much lower extent at the restrictive 
temperature of 39.5°C. The fiber-producing packaging lines 21 1 A, 21 1 B, or 21 1 R 
supported virus production at 39**C to levels within two- to three-fold of those seen at 
the permissive temperature in 293 cells, indicating that these cells provided partial 
complementation of the fiber defect. 

Interestingly, virus yields from the fiber-producing cell lines were also 
somewhat higher than those from 293 cells at 32.5°C (the 'permissive' temperature). 
This suggests that fiber produced by the ts142 virus may be partially defective even 
at the permissive temperature. Alternatively, a non-specific increase in adenoviral 
titer could result when viruses are grown in the packaging cells, by a mechanism not 
involving fiber complementation. However, it was found that viruses with wild type 
fiber genes (such as Ad.RSVPgal) replicate to identical levels either in our 
packaging lines or in 293 cells (data not shown). Taken together, these results 
demonstrate that the observed increase in H5te142 growth is due to specific 
complementation of the fiber mutation. 

Even in the fiber-expressing cell lines, the fiber mutant grows to higher titers at 
32°C than at Sg.S'^C. This incomplete complementation may be due to the 
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packaging lines' expression of fiber at a level somewhat below that seen in a wild- 
type infection (data not shown). A recent study reported an E4-deleted vector which 
colncidentally reduced fiber protein expression, resulting in a large reduction in the 
titer of virus produced (Brough ef a/., J. Virol. 70:6497-6501 (1996)), Another 
possibility is that the defective fs142 fiber protein produced at the restrictive 
temperature might form complexes with some of the wild type protein produced by 
the cells and prevent its assembly into particles. 

Although the fiber proteins of different Ad serotypes differ in the length of their 
shaft domains and in their receptor-binding knob domains, the N-terminal regions 
responsible for interaction with the viral penton base are highly conserved Amberg 
N ef a/., Virology 2272Z9'24A (1997)) (Figure 15A). This suggests that fibers from 
many viral serotypes, with their different cell-binding specificities, may be amenable 
for use in producing gene delivery vectors. 

In order to determine whether the recombinant Ad5 fiber produced by the 
packaging cells could be incorporated into particles of another adenovirus serotype, 
adenovirus type 3 was grown either in fiber-producing cell lines or in 293 cells. Viral 
particles were purified by two sequential centrifugations (3 h at 11 1 ,000 x g) on 
preformed 15-40% CsCI gradients to remove soluble cellular proteins and then 
dialyzed extensively against 10 mM Tris-HCI. pH 8.1, 150 mM NaCI, 10% glycerol. 
AdS fiber protein was detected by immunoblotting using the polyclonal anti-fiber 
serum, followed by detection with a horseradish peroxidase-conjugated goat anti- 
rabbit antibody (Kirkegaard and Perry Laboratories) and the ECL 
chemiluminescence substrate (Amersham). The purified Ad3 particles contained 
AdS fiber protein after a single passage through a fiber-expressing cell line but not 
after passage through 293 cells 

(Figure 15B). Previous work has demonstrated that Ad2 fiber is capable of 
interacting in vitro with Ad3 penton base {Fender et aL, Nature Biotech. 75:52-56 
(1997)), and our result demonstrates that the type 5 fiber protein produced by the 
cells is capable of assembling into complete Ad3 particles. 

A vector based on AdS but containing the gene for the Ad7 fiber protein has 
been described (Gall J. ef a/., J. ViroL 70:2116-2123 (1996)), as well as Ads 
containing chimeric fiber genes (Krasnykh etaL, J. ViroL 70:6839-6846 (1996) and 
Stevenson etaL, J, ViroL 69.2850-2857 (1995)). Chimeric Ad5/Ad3 vectors have 
also been reported (Stevenson, S. etaL, J. ViroL 77:4782-4790, (1997). Addition of a 
short peptide linker to the fiber in order to confer binding to a different cellular 
protein has also been reported (Michael etaL, Gene Therapy 2.-660-668 (1995). By 
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using packaging technology such as that presented here, Ad vectors equipped with 
different fiber proteins nnay be produced simply by growth in cells expressing the 
fiber of interest, without the time-consuming step of generating a new vector genome 
for each application. 

Replacing or modifying the fiber gene in the vector chromosome would also 
require that the new fiber protein bind a receptor on the surface of the cells it which it 
is to be grown. The packaging cell approach will allow the generation of Ad particles 
containing a fiber which can no longer bind to its host cells, by a single round of 
growth in cells expressing the desired fiber gene. This will greatly expand the 
repertoire of fiber proteins which can be incorporated into particles, as well as 
simplifying the process of retargeting gene delivery vectors. 

Finally, a novel fiber-independent pathway of infection has recently been 
described in hematopoietic cells, in which penton base provides the initial virus-cell 
interaction by binding to integrin ambz (Huang S. ef a/., J. Virol 70: 4502-4508 
(1996)). This suggests that viral particles lacking fiber protein may be useful in 
targeting gene delivery to specific cell types via this pathway. 



Preparation of Alternative TPLs 

The present invention contemplates the use of tripartite leader sequences 
(TPLs) that are useful in enhancing the expression of complementing adenoviral 
proteins, particularly fiber protein, for use in preparing an adenoviral gene delivery 
vector. One preferred TPL is the complete Ad5 tripartite leader contained in 
complementing vectors such as pDV67 and pDV69, both of which are prepared as 
described below. The complete Ad5 TPL was constructed by assembling PGR 
fragments. First, the third TPL exon (exon 3) (nt 9644-9731 of the Ad5 genome) was 
amplified from AdS genomic DNA using the synthetic oligonucleotide primers 
5'CTCAACAATTGTGGATCCGTACTCC3'(SEQ ID NO: 28) and 
5'GTGCTGAGCAGATCTTGCGACTGTG3' (SEQ ID NO: 29). The resulting product 
was cloned to the BamHi and Bglll sites of pAElSpla (Microbix Biosystems) using 
novel sites in the primers (shown in bold) to create plasmid pDV52. A fragment 
corresponding to the first TPL exon (exon 1), the natural first intron (intron 1), and 
the second TPL exon (exon 2) (AdS nt 6049-7182) was then amplified using primers 
5'GGCGCGTTCGGATCCACTCTCTTCC3' (SEQ ID NO:30) and 5'CTA 
CATGCTAGGCAGATCTGGTTCGGAG3' (SEQ ID NO: 31), and cloned into the 
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BamHI site of pDV52 (again using novel sites in the primers) to create pDV55. 
This plasmid contains a 1 .2 kb BamHI/Bglll fragment consisting of the first TPL exon, 
the natural first intron, and the fused second and third TPL exons. The nucleotide 
sequence of the complete TPL containing the noted 5' and 3' restriction sites is 
shown in SEQ ID NO: 32 with the following nucleotide regions identified: 1-6 nt 
BamHI site; 7-47 nt first leader segment (exon 1); 48-1068 nt natural first intron 
(intron 1); 1069-1 140 nt second leader segment (exon 2); 1 141-1 146 nt fused BamHI 
and Bglll sites; 1 147-1234 nt third leader segment (exon 3); and 1235-1240 nt Bglll 
site, 

TPLs fragments containing two of the three exons, exons in non-native order, 
or containing either the first or second TPL intron are also constructed for use in 
preparing complementing plasmids for use in the methods of the present invention. 
Briefly, DNA fragments containing any combination of 2 TPL exons can be 
constructed as follows: Exon 1 is amplified from genomic DNA as prepared above 
by using the oligonucleotides 5'GGCGCGTTCGGATCCACTCTCTTCC3'(SEQ ID 
NO: 33) and 5'GGGAGTAGATCTCCCAACAG3* (SEQ ID NO: 34). Exon 2 is 
similarly amplified from the same genomic DNA using oligonucleotides 
5'CCC I I I I I I I I GGATCCCTCGCGG3* (SEQ ID NO: 35) and 
5'CTACATGCTAGGCAGATCTGGTTCGGAG3' (SEQ ID NO: 36). Exon 3 is 
amplified using the oligonucleotides 5'CTCAACAATTGTTGGATCCGTACTCC3' 
(SEQ ID NO: 37) and 5'GTGCTCAGCAGATCTTGCGACTGTG3' (SEQ ID NO: 38). 

The amplified exons are ligated together in any desired number and/or order by 
virtue of the unique BamHI and Bglll restriction sites (bold) in the primers for 
subsequent ligation into a construct analogous to pDV67, prepared as described 
below, for expression of viral structural genes. 

Similarly, a fragment consisting of the first TPL exon (exon 1), the native first 
intron (intron 1), and the second TPL exon (exon 2) is produced by amplification 
from AdS genomic DNA with the oligonucleotide pair 5'GGCGCGTTCGGATCC 
ACTCTCTTCC3' (SEQ ID NO: 39) and 5'CTACATGCTAGGCAGATCT 
CGTTCGGAG3' (SEQ ID NO: 40). Finally, a fragment consisting of the second TPL 
exon (exon 2), the native second intron (intron 2), and the third TPL exon (exon 3) is 
produced by amplification using the oligonucleotides 5'CCC i i i i i i i i GGATCC 
CTCGCGG3' (SEQ ID NO: 41) and 5'GTGCTCAGCAGATCTTGCGACTGTG3' 
(SEQ ID NO: 42). Either of the intron-containing fragments are used either alone or 
in combination with another TPL fragment(s) in constructs analogous to pDV67. 
Introns in addition to adenoviral intron 1 used herein that have been shown to 
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increase the expression of recombinant proteins when included in expression 
constructs include SV40 VP1 intron, rabbit p-globin intron among others. The use of 
these alternative intron sequences are contemplated for use in preparing a TPL in 
the present invention. 

Example 6 

Preparation and Use of Adenoviral Packaging Cell Lines 
Containing Plasmids Containing Alternative TPLs 

Plasmids were first constructed as described below that contained TPLs are 
described above. The resultant plasmids containing different selectable markers 
such as neomycin or zeocin were then used to prepare stable cell lines for use as 
complementing vectors for preparing adenoviral vectors for use in the present 
invention. In a preferred embodiment, the resulting cell lines represent 
improvements over preexisting fiber-complementing cell lines in that fiber expression 
is enhanced with the use of alternative TPLs. 

A, pDV60 

pDV60 was constructed by inserting this TPL cassette of SEQ ID NO: 32 into 
the BamHI site upstream of the Ad5 fiber gene in pcDNA3/Fiber, a neomycin 
selectable plasmid, prepared as described in Example 1 and also as described by 
Von Seggern et aL, J. Gen Virol., 79: 1461 (1998). The nucleotide sequence of 
pDV60 is listed in SEQ ID NO: 43. 

B. pDV61 

To construct pDV61, an Asp718/Notl fragment containing the CMV promoter, 
partial Ad5 TPL, wildtype Ad5 fiber gene, and bovine growth hormone terminator 
was transferred from pCLF, prepared as described in Example 1 and also as 
described by Von Seggern et al., J. Gen Virol., 79\ 1461 (1998), to a zeocin 
selectable cloning vector referred to as pCDNA3.1/Zeo (+) (commercially available 
from Invitrogen and the sequence is also available). 
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In an analogous process, pDV67 containing connplete TPL was constructed by 
transferring an Asp 718/Xbal fragment from pDV60 to the pcDNA3.1/Zeo(+) 
backbone. The nucleotide sequence of pDV67 is listed in SEQ ID NO: 44. 

D. pDV69 

To prepare pDV69 containing a modified fiber protein, the chimeric Ad3/Ad5 
fiber gene was amplified from pGEM5TS3H (Stevenson et aL, J, Virol, 69: 
2850-2857, 1995)) using the primers 5'ATGGGAT 

CAAGATGAAGCGCGCAAGACCG3' (SEQ ID NO: 45) and 
5'CACTATAGCGGCCGCATTCTCAGTCATCTT3' (SEQ ID NO: 46), and cloned to 
the BamHl and NotI sites of pcDNA3.1/Zeo(+) via novel BamHI and NotI sites 
engineered into the primers to create pDV68. Finally, the complete TPL fragment 
described above was then added to the unique BamHI site of pDV68 to create 
pDV69. The nucleotide sequence of pDV69 is listed in SEQ ID NO: 47. 

E. Preparation of Stable Adenovirus Packaging Cell Lines 

El -2a S8 cells are derivatives of the A549 lung carcinoma line (ATCC # COL 
185) with chromosomal insertions of the piasmids pGRE5-2.E1 (also referred to as 
GRE5-E1-SV40-Hygro construct and listed in SEQ ID NO: 48) and pMNeoE2a-3.1 
(also referred to as MMTV-E2a-SV40-Neo construct and listed in SEQ ID NO: 49), 
which provide complementation of the adenoviral E1 and E2a functions, 
respectively. This line and its derivatives were grown in Richter's modified medium 
(BioWhitaker) + 10% PCS. El -2a S8 cells were electroporated as previously 
described (Von Seggern et aL, J. Gen Virol., 79: 1461 (1998)) with pDV61, pDV67. 
or with pDV69, and stable lines were selected with zeocin (600 pg/ml). The cell line 
generated with pDV61 is designated 601. The cell line generated with pDV67 is 
designated 633 while that generated with pDV69 is designated 644. Candidate 
clones were evaluated by immunofluorescent staining with a polyclonal antibody 
raised against the Ad2 fiber. Lines expressing the highest level of fiber protein were 
further characterized. 

For the S8 cell complementing cell lines, to induce El expression, 0.3 pM of 
dexamethasone was added to cell cultures 16-24 hours prior to challenge with virus 
for optimal growth kinetics. For preparing viral plaques, 5X10^ cells/well in 6 well 
plates are prepared and pre-induced with the same concentration of dexamethasone 
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the day prior to infection with 0.5 pM included at a final concentration in the agar 
overlay after infection. 

F. Development of Cell Lines for Complementation of E17E2a* Vectors 
This example shows the construction of S.8 cells 

The Adenovirus 5 genome was digested with Seal enzyme, separated on an 
agarose gel, and the 6,095 bp fragment comprising the left end of the virus genome 
was isolated. The complete Adenovirus 5 genome is registered as Genbank 
accession #M73260, incorporated herein by reference, and the virus is available 
from the American Type Culture Collection, Manassas, Virginia, U.S.A., under 
accession number VR-5. The Seal 6,095 bp fragment was digested further with Clal 
at bp 917 and Bglll at bp 3,328. The resulting 2,41 1 bp Clal to Bglll fragment was 
purified from an agarose gel and ligated into the superlinker shuttle plasmid pSE280 
(Invitrogen, San Diego, CA), which was digested with Clal and Bglll, to form 
pSE280-E. (Figure 23). 

Polymerase chain reaction (PCR) was performed to synthesize DNA encoding 
an Xhol and Sail restriction site contiguous with Adenovirus 5 DNA bp 552 through 
924. The primers which were employed were as follows: 
5' end, Ad5 bp 552-585: 

5'-GTCACTCGAGGACTCGGTC-GACTGAAAATGAGACATATTATCTGCCACGGAC 
C-3' (SEQ ID NO: 66) 
3' end, Ad5 bp 922-891: 

5'-CGAGATCGATCACCTCCGGTACAAGGTTTGGCATAG-3' (SEQ ID NO: 67) 
This amplified DNA fragment (sometimes hereinafter referred to as Fragment 

A) then was digested with Xhol and Clal, which cleaves at the native Clal site (bp 

917), and ligated to the Xhol and Clal sites of pSE280-E, thus reconstituting the 5( 

end of the El region beginning 8 bp upstream of the ATG codon. 

PCR then was performed to amplify Adenovirus 5 DNA from bp 3.323 through 

4,090 contiguous with an EcoRI restriction site. The primers which were employed 

were as follows: 

5* end, Ad5 bp 3323-3360: 

5'-CATGAAGATCTGGAAGGTGCTGAGGTACGATGAGACC-3' (SEQ ID NO: 68) 
3* end, Ad5 bp 4090-4060: 
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5'-GCGACTTAAGCAGTCAGCTG-AGACAGCAAGACACTTGCTTGATCCAAATCC-3 
' (SEQ ID NO: 69) 

This amplified DNA fragment (sometimes hereinafter referred to as Fragment 
B) was digested with Bglll, thereby cutting at the Adenovirus 5 Bglll site (bp 3.382) 
and EcoRI. and ligated to the Bglll and EcoRI sites of pSE280-AE to reconstruct the 
complete E1a and Elb region from Adenovirus 5 bp 552 through 4,090. The 
resulting plasmid is referred to as pSE280-E1 (Figure 23), 

A construct containing the intact Ela/b region under the control of the synthetic 
promoter GRE5 was prepared as follows. The intact Ela/b region was excised from 
pSE280-E1, which was modified previously to contain a BamHI site 3' to the El 
gene, by digesting with Xhol and BamHI. The Xhol to BamHI fragment containing the 
Ela/b fragment was cloned into the unique Xhol and BamHI sites of pGRE5-2/EBV 
(Figure 4. U.S. Biochemicals. Cleveland, Ohio) to form pGRE5-E1 (Figure 24). 

Bacterial transformants containing the final construct were identified. Plasmid 
DNA was prepared and purified by banding in CsTFA prior to use for transfection of 
cells. 

Construction of plasmid including Adenovirus 5 E2A sequence. 

The Adenovirus 5 genome was digested with BamHI and Spel, which cut at bp 
21,562 and 27,080. respectively. Fragments were separated on an agarose gel and 
the 5,518 bp BamHI to Spel fragment was isolated. The 5,518 bp BamHI to Spel 
fragment was digested further with Smal, which cuts at bp 23.912. The resulting 
2,350 bp BamHI to Smal fragment was purified from an agarose gel. and ligated into 
the superlinker shuttle plasmid pSE280, and digested with BamHI and Smal to form 
pSE280-E2 BamHI-Smal (Figure 26). 

PGR then was performed to ainplify Adenovirus 5 DNA from the Smal site at bp 
23.912 through 24.730 contiguous with Nhel and EcoRI restriction sites. The 
primers which were employed were as follows: 
5* end. Ad5 bp 24,732-24.708: 

5'-CACGAATTCGTCAGCGGTTCTCGTCGCGTCCAAGACCC-3' (SEQ ID NO: 70) 
3' end, Ad5 bp 23,912-23,934: 

5'-CACCCCGGGGAGGCGGCGGCGACGGGGACGGG-3' (SEQ ID NO: 71) 

This amplified DNA fragment was digested with Smal and EcoRI. and ligated to 
the Smal and EcoRI sites of pSE280-E2 Bam-Sma to reconstruct the complete E2a 
region from Ad5 bp 24,730 through 21 ,562. The resulting construct is pSE280-E2a. 
(Figure 27.) 
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In order to convert the BamHI site at the 3' end of E2a to a Sail site, the E2a 
region was excised from pSE280-E2a by cutting with BamHI and Nhel, and recloned 
into the unique BamHI and Nhel sites of pSE280. (Figure 6.) Subsequently, the 
E2a region was excised from this construction with Nhel and Sal! in order to clone 
into the Nhel and Sail sites of the pMAMneo (Clonetech, Palo Alto, CA) multiple 
cloning site in a 5' to 3' orientation, respectively. The resulting construct is 
pMAMneo E2a. (Figure 27). 

Bacterial transformants containing the final pMAMneo-E2a were identified. 
Plasmid DNA was prepared and purified by banding in CsTFA. Circular plasmid 
DNA was linearized at the XmnI site within the ampicillin resistance gene of 
pMAMneo-E2a, and further purified by the phenol/chloroform extraction and ethanol 
precipitation prior to use for transfection of cells. 
Transfection and selection of cells. 

In general, this process involved the sequential introduction, by calcium 
phosphate precipitation, or other means of DNA delivery, of two plasmid 
constructions each with a different viral gene, into a single tissue culture cell. The 
cells were transfected with a first construct and selected for expression of the 
associated drug resistance gene to establish stable integrants. Individual cell clones 
were established and assayed for function of the introduced viral gene. Appropriate 
candidate clones then were transfected with a second construct including a second 
viral gene and a second selectable marker. Transfected cells then were selected to 
establish stable integrants of the second construct, and cell clones were established. 
Cell clones were assayed for functional expression of both viral genes. 

In order to determine the most suitable cell lines for the above-mentioned 
transfections, sequential transfections and selections were carried out with the 
following parental cell types: 

A549 (ATCC Accession No. CCL-1 85); 

Hep-2 (ATCC Accession No. CCL-23); or 

KB (ATCC Accession No. CCL-17). 

Appropriate selection conditions were established for both G418 and 
hygromycin B for all three cell lines by standard kill curve detennination. 
Transfection of cell lines with plasmids including El and E2a regions. 

pMAMNeo-E2a was linearized with XmnI with the Amp" gene, introduced into 
cells by transfection, and cells were selected for stable integration of this plasmid by 
G418 selection until drug resistant colonies arose. The clones were isolated and 
screened for E2a expression by staining for E2a protein with a polyclonal antiserum, 





wo 00/42208 



PCT/EPOO/00265 



-81 - 



and visualizing by immunofluorescence. E2a function was screened by 
complementation of the temperature-sensitive mutant Ad5ts125 virus which contains 
a temperature-sensitive mutation in the E2a gene. (Van Der Vliet. et aL, J. Virology, 
Vol. 15, pgs. 348-354 (1975)). Positive clones expressing the E2a gene were 
identified and used for transfection with the 7 kb EcoRV to XmnI fragment from 
pGRE5-E1 (Figure 5), which contains the GRE5 promoted E1a/b region plus the 
hygromycin" gene. Cells were selected for hygromycin resistance and assayed for 
Ela/b expression by staining with a monoclonal antibody for the El protein 
(Oncogene Sciences, Uniondale, N.Y,). El function was assayed by ability to 
complement an E1 -deleted vector. At this point, expression and function of E2a was 
verified as described above, thus establishing the expression of both Ela/b and E2a 
in the positive cell clones. 

One of the transfected A549 cell lines showed good Ela/b and E2a expression 
and was selected for further characterization. It was designated the S8 cell line. 

G. Preparation of Adenoviral Vectors Containing Ad5.pgal.AF Genome in 



To prepare adenoviral vectors containing Ad5.pgal.AF in SB cells containing 
alternative forms of TPL for enhancing the expression of fiber proteins, the protocol 
as described in Example 2 for preparing Ad5.pgal.AF in 21 1 B cells was followed 
with the exception of pretreatment with 0.3 pM dexamethasone for 24 hours as 
described above. Thus, viral particles with the wildtype Ad5 fiber protein on their 
surface and containing the fiberless Ad5.pgal.AF genome were produced in 633 
cells. Particles produced in 644 cells also contained the fiberless Ad5.pgal.AF 
genome, but had the chimeric 5T3H fiber protein, with the Ad3 fiber knob, on their 
surface. 

The preparation of the cell lines and demonstration of stable nuclear 
expression of either wild-type Ad5 fiber protein or chimeric Ad5/Ad3 protein is shown 
in Figure 20. In the figure, schematic diagrams are presented of the constructs used 
to generate the cell lines as well as immunofluorescence results indicating the 
presence of expressed fiber protein in the nucleus of the cells. An indirect 
immunofluorescence assay of A549 based cell lines which stably express the 
different Ad fibers is shown. Line 633 expresses the native Ad5 fiber protein and line 
644 expresses a chimeric fiber protein with the tail and shaft domains of the Ad5 
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protein and the knob domain of the Ad3 fiber. Previous work (Stevenson et al., 1996) 
showed that a virus containing this protein had the tropism expected for Ad3. 

Thus, these viral preparations, prepared as described herein and in Example 2, 
are useful for targeting delivery of Ad5.pgal.AF fiberless genome with either wild- 
type or modified fibers, embodiments of which uses have been previously discussed 
and as further exemplified with the pseudotyping and infectivity results described in 
Example 7. 

Example 7 

Pseudotyping and Infectivity of Recombinant Adenoviral Vectors 
Produced with Improved Fiber-Complementing Cell Lines 

A. Pseudotyping of AdS.pgal.AF 

To verify that adenoviral vectors were produced had altered tropisms, viral 
particles were purified from either 633 (expressing wilt type Ad5 fiber) or 644 cells 
(expressing the chimeric Ad5/Ad3 fiber) lO^ig of the purified particles were Western 
blotted and probed with a polyclonal rabbit antibody against the Ad2 fiber (which 
detects both the Ad5 and chimeric 5T3H fiber proteins.). Equal amounts of purified 
Ad.pgal.wt or Av9LacZ ( which has the chimeric fiber gene in the viral chromosome) 
were run as controls. The results are shown in Figure 21 where both fiber proteins 
were detectable confirming pseudotyping. 

B. Infectivity of Cells with 633 or 644 Generated Virus Particles 

The cell lines, 633 or 644, prepared as described above, were infected with the 
indicated number of particles/cell of AdS.pgal.AF and virus particles produced. 
Virus was then used to infect, as previously described, selected cell lines as shown 
in Figure 22, including 211B, MRC-5 human fibroblasts. A-10 rat aortic endothelial 
cells, and THP-1 human monocytic cells. Unbound virus was removed by washing 
the cells and the cells were further incubated at 37°C for 48 hours. Cells were then 
fixed with glutaraldehyde and stained with X-gal. The percentage of stained cells 
was then determined by light microscopy where all experiments were done in 
triplicate. 

The results shown In Figure 22 indicate that adenoviral vectors could be 
retargeted by pseudotyping using packaging cell lines expressing different fiber 
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proteins. The data marked with "none" indicates virus grown in 293 cells and lacking 
fiber, while "Ad5" indicates virus prepared in 633 cells (containing the wild type fiber) 
and AdS indicates virus prepared in 644 cells (containing the chimeric 5T3H fiber.) 
Particles containing either fiber were equally infectious on 21 1 B cells, while MRC-5 
fibroblasts and THP-1 cells were more readily infected by virus containing the 
chimeric fiber. The A-10 rat endothelial cells were more readily infected by particles 
containing the wlldtype AdS fiber protein. 

Example 8 

Targeted Gene Delivery Using Viral Vector 
Particles Lacking Fiber Protein 

An alternative mode of entry for adenoviral infection of hematopoietic cells 
has been described by Huang, et al., J. Virol.. 69:2257-2263 (1995) which does not 
involve the fiber protein-host cell receptor interaction. As infection of most other cell 
types does require the presence of fiber protein, vector particles which lack fiber 
may preferentially infect hematopoietic cells, such as monocytes or macrophages. 

To produce a fiber-free adenovirus vector particle, a vector lacking the fiber 
gene as described above in Example 2A but containing a gene of interest for 
delivery is amplified by growth in cells which do not produce a fiber protein, such as 
the 21 1 cells prepared in Example 1 or 293 or S8 cells as described herein, thereby 
producing large numbers of particles lacking fiber protein. The recovered fiber-free 
viral particles are then used to deliver the inserted gene of interest following the 
methods of this invention via targeting mechanisms provided by other regions of the 
adenoviral vector, i.e., via the native penton base. 

A. Construction of an Adenovirus Vector Deleted for El, E3, 
and Fiber, and Carrying a Therapeutic Gene of Interest 

A general method of constructing a fiber-deleted Ad vector containing a 
therapeutic gene of interest (in this example, the Herpes Simplex Virus Thymidine 
Kinase (TK) gene) is described here. Linear viral DNA is isolated from a 
preparation of AdS.pgal.AF particles. This DNA is digested with the restriction 
enzyme Clal, which removes the leftmost viral sequences including the left ITR, the 
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packaging signals, and part of the S\/40-driven p-galactosidase gene. The large 
Clal fragment with the rennainder of the fiber-deleted viral genome is then isolated by 
centrifugation on a sodium chloride or sucrose gradient. The plasmid pAdShuttleTK, 
which contains the left part of the Ad chromosome with an RSV-driven TK gene 
inserted in place of the E1 region, is linearized by digestion with Notl. The 
nucleotide sequence of the pAdShuttleTK is shown in SEQ ID NO: 50. The large 
Clal fragment of Ad5.pgal.AF and the linearized pAdShuttleTK are cotransfected 
into 21 IB cells, and an infectious adenovirus genome is generated by homologous 
recombination. A virus deleted for E1 , E3, and fiber that contains the TK cassette in 
the place of the El deletion is thus recovered. A vims containing any desired 
therapeutic gene of interest can be created in this manner by replacing the TK gene 
of the example with the gene of interest. 

An altemative method of constructing a fiber-deleted genome containing a 
therapeutic gene (in this example the retinal degeneration-slow (RDS) gene driven 
by the CMV immediate early promoter) is described here. RDS is a protein 
expressed in photoreceptors, and essential for their proper development and 
functioning. RDS mutations have been implicated in retinal degenerative disorders, 
and transfer of the wildtype RDS gene by means of an Ad vector provides an avenue 
towards treating such disorders. 

A plasmid (pDV50) analogous to pAEIBpgal but containing a CMV-driven 
RDS gene was constructed as follows. First, a fragment containing the CMV 
promoter and enhancer was excised from pCHaMlEP by digestion with Hindlll, filling 
the overhanging ends with the large fragment of E. coli DNA polymerase 1 , ligation 
of BamHI linkers (5'CGCGGATCCCG3' SEQ ID NO: 51) to the blunt ends, and 
digesting with BamHI. The resulting fragment was then ligated into the BamHI site 
of pAEIspla (Mikrobix) to create pDV45. A fragment containing the SV40 
polyadenylation signal was amplified from pSVpgal (Promega) using the 
oligonucleotides 5'CTGACAAACTCAGATCTTGTTTATTG3' (SEQ ID NO: 51) and 
5'GTCGACTCTAGAGGATCCAGA3' (SEQ ID NO: 52). This fragment was ligated 
into the Bglll site of pDV45 to create pDV46, using the unique BamHI and Bglll sites 
(bold type) in the primers. Finally, the human RDS open reading frame was 
amplified from the plasmid pRDS-T7 using the oligonucleotides 5'CCGGACTCT 
AGATGGCAACCATGGCGCTAC3' (SEQ ID NO: 53) and 5'GGA 
GGGGAAGCTTGGCCCTCAGCCAGCCTCT3' (SEQ ID NO: 54). This fragment 
was inserted into the Hindlll and Xbal sites of pDV46, again using unique restriction 
sites in the primers, to create pDV50. pDV50 therefore contains a cassette 
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consisting of the CMV promoter, the RDS open reading frame, and the SV40 
temninator sequences inserted in place of the Ad5 E1 region. 

In a manner analogous to the construction of Ad5.pgal.AF , pDV50 and 
pDV44 are then co-transfected into 211 B cells, and an infectious Ad genome 
(Ad5.RDS.AF) is recovered. A fiber-deleted Ad vector containing any desired gene 
to be expressed can be constructed by replacing the RDS gene of this example with 
the gene of interest. 

Example 9 

Transient Transcomplementation 

Human adenovirus type 5 (Ad5) is being developed as a vector for gene 
therapy. Its ability to deliver therapeutic genes to cells is mediated by the interaction 
of the adenoviral fiber protein with the coxsackievims-adenoviral receptor (CAR). 
Because a wide-range of cells express CAR, it can be difficult to use adenoviruses 
to deliver genes to specific cell types. One way to address this is to target the virus 
to a particular cell type by genetically altering the fiber. However, the genetic 
manipulations involved in cloning and production of the viruses with altered fibers 
can be time-consuming. Thus it would be a significant advancement in the field of 
adenoviral gene therapy to have a more streamlined system for testing modified fiber 
genes. An in vitro system has thus been developed that involves infection of tissue 
culture cells with a fiber-deleted Ad and transient co-transfection with a plasmid 
directing fiber expression. This system allows one to produce and evaluate such 
modified fibers in the context of a viral particle easily and quickly. In addition this 
system can be envisioned to actually produce therapeutic quantities of adenoviral 
vectors with modified fiber proteins, with such fibers having a new tropism added by 
insertion of a desired ligand into the fiber gene. These fibers may also have the 
natural tropism {i.e. binding to CAR) ablated. 

Plasmids used were pDV60 and pDV55, prepared as described herein. 
pDV60 is an pcDNA3,1 -based expression plasmid that contains the CMV promoter, 
Ad5 tripartite leader, an intron, and the Ad5 fiber gene sequence. pDV55 contains 
no fiber gene and serves as the negative control. Ad5.pgal.AF and 21 1 B are 
described above. 293T cells are identical to 293 cells except they express an 
integrated SV40 large T antigen gene. HDF cells are human diploid fibroblasts. 
293T cells express CAR and a, integrins; HDF cells express otv integrins but no 
CAR. Transfections with fiber expression plasmids were performed with 
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Lipofectamine (GIBCO-BRL) using 20mg DNA and 50ml Lipofectamine per 15cm 
dish. Cells were maintained in DMEM supplemented with 10% fetal bovine serum. 

The fiber deletion mutation of Ad5.pgal.AF is complemented in trans by 
passaging virions through 21 1B, a cell line that stably expresses functional Ad5 
fiber. The present system was designed to complement Ad5.pgal.AF by modified 
fibers expressed from transfected episomal plasmids in 293T cells. The result is a 
simplified and rapid method to incorporate modified fibers on a viral particle 
containing the Ad5.pgal.AF genome that does not require propagation of the virus. 

The feasibility of transcomplementation of Ad5.pgal.AF with episomal 
fjber-expressing plasmids was demonstrated in the following experiment. 293T cells 
were transfected with one of two plasmids: pDV55, which expresses no fiber or 
pDVeo, which expresses wildtype Ad5 fiber. Fiber expression persists for at least 
six days, suggesting that the plasmid is stable as an episome for this amount of time 
Twenty-four hours after transfection, these cells were infected at 2000 particles/cell 
with Ad5.pgal.AF passaged through 21 IB cells. Seventy-two hours later, a crude 
viral lysate (CVL) was generated by exposing the cells to five freeze-thaw cycles. 
Viral particles were purified by cesium chloride gradient centrifugation. The resulting 
virions incorporated the fiber expressed from the episomal plasmid, as confirmed by 
Westem blots performed with an antibody specific to the Ad5 fiber. 

To demonstrate the functionality of these virions, the transduction efficiency 
was tested. The virions containing no fiber (pDV55) or wildtype fiber (pDV60) were 
applied to monolayers of 293T and HDF cells at different multiplicity of infection 
(Mors). 293T cells express CAR and a Ov'integrins; HDF cells express Ov integrins 
but no CAR. After 2 days, the cells were fixed and stained with X-gal to detect the 
pgalactosidase reporter gene activity. The results showed low transduction 
efficiency for the pDV55-complemented virions in both cell lines. As expected, the 
pDV60-complemented virions transduced 293T cells to a high degree but did not 
transduce HDF cells, indicating that functional fiber proteins had been expressed 
from the episomal plasmids and incorporated into the virions. This transduction 
efficiency was comparable to or better than that of Ad5.pgal.AF virions passaged 
through the 21 IB cells. 

Episomal plasmid transcomplementation system is suitable for quickly 
expressing and evaluating the properties of modified fibers in the context of a viral 
particle. Episomal plasmid transcomplementation will also be of great utility for 
quickly evaluating a bank of modified fibers for other binding properties, including 
novel tropism and the ablation of the native tropism. In addition to the rapid 
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generation and testing of large numbers of modified fibers, there are other 
advantages to the Ad5.pgal.AF transcomplementation system in terms of production 
and safety. Episomal plasmid transcomplementation has the inherent advantage 
over transcomplementation in that it is not necessary to make a stable cell line for 
every modified fiber with which you want to complement Ad5,pgal.AF . Because the 
Ad5.pgal.AF is deleted in El, E3 and fiber, there is an additional gene deletion 
compared to other first generation vectors. This makes Ad5.pgal.AF more 
replication defective and presumably safer. In addition, the presence of the fiber 
gene deletion decreases the opportunity to generate replication-competent virus via 
recombination in the packaging cells. In temis of production a single Ad vector prep 
could be retargeted to any number of different cell types simply by transfecting the 
cells with the appropriate fiber-expression construct. 



Adenovirus type 37 (subgroup D) has been associated with infections of the 

eye 

and genital tract, and may be useful for targeting these tissues or other mucous 
membranes, as well as other cell types. The tropism of Ad37 is due to the binding 
preference of its fiber protein, which binds to an as yet-unidentified receptor located 
on the surface of cells including Chang C, conjunctival epithelial cell line (Huang et 
ai, J. Virology 73(4):2798-2802 (1999)). As this fiber directs viral infection to cell 
types different than those infected by AdS. it is likely to provide a method for 
targeting gene delivery. This example describes construction of packaging cell lines 
expressing the Ad37 fiber protein, and their use in generating particles of a fiber- 
deleted Ad vector (such as Ad5.pgaLAF ) containing this fiber protein. The fiber 
protein is attached to the viral capsid by binding to the penton base protein through 
its N-terminus, and the Ad37 fiber was modified in order to make its N-terminal 
sequence more closely match that of the AdS protein to ensure that it would 
efficiently bind the AdS penton base in these vectors. 



Example 10 



Adenoviral Gene Delivery Vectors Containing the 



Ada? Fiber Protein 



1. Construction of an Expression Plasmid for the Ad37 Fiber Protein 
(pDVSO) 
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This plasmid uses the same regulatory elements as contained in pDV60, 
pDV67, and pDV69 to express the Ad37 fiber in packaging lines, and was 
constructed in two steps. First, the Ad37 fiber open reading frame was amplified 
from Ad37 genomic DNA (obtained from the ATCC - accession number VR- 
929)using the synthetic oligonucleotides primers L37 (5' TGT CTT GGA TCC AAG 
ATG AAG CGC fiCC CGC CCC AGC GAA GAT GAC TTC 3') (SEQ ID NO: 56) and 
37FR (5' AAA CAC GGC GGC CGC TOT TTC ATT CTT G 3') (SEQ ID NO: 57). L37 
contains nucleotides that differ from the Ad37 genomic sequence in order to add an 
unique Bam HI site (bold in the above sequence) and create point mutations to 
make the N-terminal sequence of the fiber more closely match that of the AdB 
protein (underiined in the above sequence; the start codon is italicized). 37FR 
incorporates changes to create a unique Not 1 site (bold). The PGR product was 
Inserted into the Bam H1 and Not 1 sites of pCDNA3.1zeo{+) (Invitrogen) to create 
pDV78. The correct sequence of the Ad37 fiber gene, including the predicted 
changes, was confirmed by sequencing. 

Second, a 1.2 kb Bam HMBgl II fragment containing an adenovirus type 5 
tripartite leader was excised from pDV55 (DVS 1999) and inserted into the Bam H1 
site of pDV78 to create pDVBO (SEQ ID NO:64) 

2. Isolation of CeU Lines Expressing the Ad37 Fiber Protein 

pDV80 DNA was purified using the Qiagen method and electroporated into 
the adenovirus-complementing cell line El -2a SB (Gorziglia et ai, J. Virology 
70(5)'AMZ-AMQ (1996)) as previously described (Von Seggern, et ai. J. Gen. Virol. 
79:1461-1418), and stable clones were selected with 600 pg/ml zeocin (Invitrogen). 
Clones were expanded and screened for fiber expression by indirect 
immunofluorescence using a rabbit polyclonal antibody directed against the Ad37 
fiber. Two clones (lines 705 and 731) that expressed the protein at a unifonnly high 
level were selected for further study. 

3. Production of Pseudotyped Ad Vector Particles 

To generate vector particles equipped ('pseudotyped') with the Ad37 fiber 
protein, the Ad37 fiber-expressing 705 cells were infected (approximately 1 000 
particles/cell) with Ad5.pgal.AF or with Ad5.GFP.AF. 
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Ad5.pgal.AF is prepared as previously described. AdS.GFP.AF was 
constructed by recombination in bacteria using a modification of the method of (He, 
ef a/., PNAS 95:2509-2514 (1998)). First, a fiber-deleted genomic plasmid was 
constructed by removing the fiber gene from pAdEasyl (He, ef al,, PNAS 95:2509- 
2514 (1998)). pDV43 (Von Seggern, et al., J. ViroL 73:1601-1608 (1999)) was 
digested with Pac 1 . the ends blunted by treatment with the large fragment of E. coli 
DNA polymerase and dNTPs, and the product re-ligated. The resulting plasmid. 
pDV76, is identical to pDV43 except for loss of the Pac 1 site and contains the right 
end of the Ad5 genome with E3 and fiber deletions. A 4.2 kb fragment was amplified 
from pDV76 using the oligonucleotides primers 5' CGC GCT GAG TCT TA GGA 
CTA GTT TC 3' (SEQ ID NO: 58) (including the unique Spe 1 site in the Ad5 
genome, bold) and 5' GOG GTT AAT TAA CAT CAT CAA TAA TAT ACC TTA TTT T 
3' (SEQ ID NO: 59) (including a novel Pac 1 site (bold) adjacent to the right Ad5 
ITR). This PCR fragment therefore contains nucleotides 27,082 to 35,935 of the Ad5 
genome with a deletion of nucleotides 28133 to 32743 (the E3 and fiber genes), and 
was used to replace the corresponding Spe MPac 1 fragment of PAdEasyl to create 
pDV77. 

E. CO// strain BJ5183 was electroporated with a mixture of pDV77 and Pme 1- 
linearized pAdTrack as described (He et al., 1998), and DNA was isolated from 
kanamycin-resistant colonies. The resulting plasmid, pDV83, contains a complete 
E1-, E3-, and fiber-deleted Ad5 genome with a CMV-driven GFP reporter gene 
inserted at the site of the El deletion. The full-length Ad chromosome was isolated 
by Pac 1 digestion, and transfected to the El- and fiber-complementing 633 cells 
(Von Seggern et aL, J, Virol January 2000). The recovered virus was then plaque 
purified by plating on 633 cells and stocks were prepared. 

Ad5-pseudotyped particles were generated by virus growth In 633 cells, which 
express the wild type Ad5 fiber protein. Viral particles were isolated and purified 
over CsCI gradients as previously described (Von Seggern et al., J. ViroL 73:1601- 
1608, 1999). For analysis of viral proteins, ten pg of the purified particles were 
electrophoresed on 8-16% gradient gels and the protein transferred to nylon 
membranes. The blot was then probed with rabbit polyclonal antibodies raised 
against recombinant Ad37 fiber or Ad5 fiber or penton base proteins expressed in 
baculovirus-infected cells (Figure 27). 



Example 11 
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Construction of a Fiber Expression Construct Containing a Post- 
Transcriptional Regulatory Element 

Previous studies have shown that mRNA transcribed from the woodchuck 
hepatitis virus (WHV) genome contains an element (the WHV post-transcriptional 
regulatory element, or WPRE) which can increase expression of a protein encoded 
by the mRNA via a post-transcriptional mechanism (Loeb et al., Human Gene 
Therapy y£?:2295-2305 (1999)). The WPRE has also been shown to enhance 
expression of transgenes delivered by retroviral vectors. (Zufferey, R. et al., J. Virol. 
73:2886-2892 (1999)). This example describes the construction of a fiber 
expression construct (pDV90) containing a WPRE as well as the promoter and TPL 
sequences as contained in pDV67. 

A piasmid (pBS/WPRE) which contains the WPRE was obtained from Dr. 
Thomas Hope, Salk Institute. Digestion of pBS/WPRE with C/al releases a 600 bp 
fragment containing the WPRE (nt 193-1684 of the WHV genome.) Following C/al 
digestion, the ends of this fragment were filled by treatment with the large fragment 
of E. coli DNA polymerase 1 in the presence of dNTPs to render them blunt. pDV67 
DNA was digested with Xba^ (which cuts at a unique site in the transcribed region 
downstream of the Ad5 fiber open reading frame) and the ends filled by the same 
treatment. The filled WPRE fragment was then ligated into the filled Xba 1 site of 
pDV67 to create pDV90 (SEQ ID NO: 65). The sequence is found at GenBank 
accession no. J04514 (entire genome) in Zufferey, R. et al., J. Virol. 73:2886-2892 
(1999). - 

pDV90 was electroporated into El -2a SB cells and stable clones expressing 
fiber isolated as described previously for pDV80. 

Example 12 

Construction ofanAdS Fiber Protein with Heterologous 
Peptide Sequences Inserted in the HI Loop 

The receptor-binding knob domain of the Ad5 fiber protein contains several 
surface loops which are attractive candidates for the insertion of heterologous 
peptide sequence, as an additional ligand for vector targeting. This example 
describes the construction of a fiber gene which encodes a fiber protein containing a 
6 amino acid peptide linker in the HI loop, and retains the ability to trimerize. The 
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modified gene also contains a unique novel restriction site at the position of the 
linker insertion to facilitate addition of the targeting ligand into the Hi loop. 

The Ad5 fiber gene was amplified from Ad5 genomic DNA (ATCC accession 
number VR-5) using the primers Fiber ATG (5' TGA AGC GCG CAA GAC CGT CTG 
AAG 3') (SEQ ID NO: 60) and Fiber TAA (5' CAT AAC ACT GCA GAT TCT TTA 
TTC TTG G 3") (SEQ ID NO: 61), and cloned to the A/c/e1 (filled with the large 
fragment of E. coli DNA polymerase 1 in the presence of dNTPs) and Pst 1 sites of 
pT7-7 using a unique Pst 1 site (bold) in the 'Fiber TAA' oligo. The resulting 
plasmid, pT7/fiber, was digested with Xba 1 and Pst 1 to excise the fiber gene, which 
was then cloned into the Pst 1 and Xba 1 sites of pUC1 19 to create pUC/fiber. This 
pUC-derived plasmid contains an origin for single-stranded DNA replication and can 
therefore be used to create template DNA for site-directed mutagenesis. 

Site-directed mutagenesis was carried out according to the method of Kunkel 
(T.A. Kunkel, PNAS 52:488-492 (1985)) using the oligonucleotide primer T542 (5' 
GGT ACA CAG GAA ACA GGA GGT TCC GGA GGT GGA GGA GAC ACA ACT CC 
3") (SEQ ID NO: 62). This results in the addition of 18 new bases (underlined) 
encoding the sequence Gly Gly Ser Gly Gly Gly (SEQ ID NO: 63). with a novel 
BspEI site (bold) for the addition of further sequences. The inserted sequence is 
between Thr542 and Gly543 of the Ad5 fiber protein, in the HI loop. The modified 
plasmid is termed pDV14. 

Finally, the modified fiber gene was excised from pDV14 by digestion with Pst 
1 and Xba 1 and cloned into the Pst 1 and Xba 1 sites of pGEM3Z (Promega) to 
create pDV18. In vitro transcription/translation experiments with pDV18 (using the 
TNT™ kit, Promega) demonstrated that the modified fiber gene encoded a protein 
which was capable of Irimerlzing. 

Alternatively an Ad5 fiber open reading frame (ORF) is amplified from Ad5 
genomic DNA (wildtype Ad5 was purchased from the ATCC) using the 
oligonucleotides 5' ATG GGA TCC AAG ATG AAG CGC GCA AGA CCG 3' (SEQ ID 
NO: 72) and 5' CAT AAC CTG CAG GAT TCT TTA TTC TTG GGC 3' (SEQ ID NO:,73) 
and inserted into the BamH^ and Pst 1 sites of pGEM-3Zf(+) (Promega Inc.. 
Madison. Wl) via novel restriction sites (bold type) designed into the primers. The 5' 
oligonucleotide also contains a G to A change 3 nucleotides 5' of the initial ATG 
codon (underiined), designed to improve the consensus for translation initiation. 

Site-directed mutagenesis is performed by the method of Kunkel (Proc. Nat. 
Acad. Sci. 82:488-492 (1985)). using the synthetic oligonucleotide 5' GGT ACA CAG 
GAA ACA GGA GGT TCC GGA GGT GGA GGA GAC ACA ACT CC 3' ((SEQ ID 
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NO: 74). This operation introduced sequence (bold type) encoding 6 novel amino 

acids (Gly Gly Ser Gly Gly Gly) immediately following Threonine 542 of the Ad5 
fiber, and including a unique restriction site for the insertion of further heterologous 
sequences (underlined). The resulting plasmid (pDV18A) contains the modified 
fiber gene under the control of the T7 promoter in the parental pGEM-3Zf(+) and 
can be used for in wYro transcription/translation reactions to produce labeled fiber 



Adenoviral vectors which lack essentially all Ad genes ('helper-dependenf or 
'gutless' vectors) have recently been developed. In a modification of this idea, 
vectors ('hybrid' vectors) which contain an adeno-associated virus (AAV) or retroviral 
genome have been generated. As AAV and retroviral genomes integrate into the 
chromosome of the target cells, the hybrid Ad/AAV or Ad/retroviral vectors have the 
potential to provide very long-term gene expression. 

Lieber et aL, {J. Virol. 73(1 1):931 4-9324) describe an Ad vector (Ad.AAVI) 
which contains an AAV vector genome (a transgene insert flanked by the AAV 
inverted terminal repeats) inserted into the E1 region. When 293 cells are infected 
by Ad.AAVI . recombination between the AAV sequences generates a minimal Ad 
chromosome which carries the Ad inverted terminal repeats and packaging signal 
flanking the AAV vector genome. This chromosome cannot direct the synthesis of 
Ad proteins, but can be packaged into Ad vector particles. The remaining 
unrecombined Ad chromosomes provide the Ad structural proteins in trans, and both 
the full-length and minimal genomes are packaged into particles. The particles 
carrying the minimal Ad/AAV hybrid vector are then isolated by CsCI centrifugation. 

These particles have the capsid structure of adenovirus, and infect cells using 
the efficient fiber- and penton base-mediated pathway used by Ad. Following 
infection, the hybrid genome is able to integrate into the cell's chromosomes by 
virtue of its AAV sequences. In this example, the AAV vector genome is inserted 
into the El region of a fiber-deleted vector, and the resulting vector is grown in 
packaging lines expressing either the Ad5 or Ad37 fiber proteins. The particles 
recovered therefore have the tropisms expected from the respective fiber proteins 
combined with the ability to integrate their AAV genome into target cells. Such 



protein. 
Example 13 



Use of the Fiber Expression System to Retarget 
('Pseudotype') Hybrid Ad/AAV Vectors 
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pseudotyping should be possible with any of a number of modified fiber proteins, as 
for the fiber-deleted vectors already described by us. 

The Ad vector is constructed in a manner analogous to that described for 
Ad5.pgal.AF , by recombination between pAd.AAVI (Lieber et ai J. ViroL 73.^93^4- 
9324, 1999) and pDV44 (as described earlier in the specification.) pAd.AAV1 
carries an MLV promoter-driven secreted alkaline phosphatase gene (SEAP) as a 
reporter, and an SV40-driven neomycin phosphotransferase (neo) gene to allow the 
selection of cells stable transduced by the AAV cassette. The resulting vector 
(Ad.AAVI.CSF) has the AAV vector cassette of Ad.AAV1 inserted into the El region 
of a genome with the fiber deletion of Ad5.pgal.AF . Growth of Ad.AAV1.AF in 633 
cells results in particles carrying the AAV genome and the AdS fiber, and which have 
the tropism associated with Ad5. Growth of Ad.AAV1.C3)F in 705 cells produces 
particles bearing the Ad37 fiber and therefore having its associated different tropism. 

Tropism is evaluated by infecting Chang G cells (which express the Ad37 
receptor) and A549 cells which do not express this protein but do express the AdS 
receptor (CAR). The extent of infection is monitored by assaying alkaline 
phosphatase expression, and the fraction of cells stable transduced is assayed by 
selection with neomycin. By using purified recombinant AdS or Ad37 fiber proteins 
as competitors during infection, the usage of the expected receptors by the 
pseudotyped particles is evaluated. 

Example 14 

Use of the Fiber Expression System to Retarget 
CPseudotype') Helper-dependent Ad Vectors 

Gutted Ad vectors are those from which most or all viral genes have been 
deleted. They are grown by co-infection of the producing ceils with a "helper" virus 
(such as using an El -deleted Ad vector). The helper virus frans-complements the 
missing Ad functions, including production of the viral structural proteins needed for 
particle assembly. In one embodiment of this invention, the helper virus Is a fiber- 
deleted Ad (such as that described in Von Seggern a/., J. ViroL 73:1601-1608 
(1999)). The vector is prepared in a fiber expressing cell line such as has been 
previously described by Von Seggern et al., J. Gen, ViroL 75:1461-1468 (1998), Von 
Seggern et aL, J, Virol, 74:354-362 (2000). All the necessary Ad proteins except 
fiber are provided by the fiber-deleted helper virus, and the particles are equipped 
with the particular fiber expressed by the host cells. A concern with gutted vectors 
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One way to do this Is to mutate the packaging sequence by deleting one or more of 
the nucleotides comprising the sequence or otherwise mutating the sequence to 
inactivate or hamper the packaging function. An alternative approach is to engineer 
the helper genome so that recombinase target sites flank the packaging sequence 
and to provide a recombinase in the packaging cell. The action of recombinase on 
such sites results in the removal of the packaging sequence from the helper virus 
genome. Preferably, the recombinase is provided by a nucleotide sequence in the 
packaging cell that encodes the recombinase. Most preferably, such sequence is 
stably integrated into the genome of the packaging cell. Various kinds of 
recombinase are known by those skilled In the art. The preferred recombinase is 
Cre recombinase, which operates on so-called lox sites, which are engineered on 
either side of the packaging sequence as discussed above* Further information 
about the use of Cre-loxP recombination Is found In U.S. Pat, No. 5,919,676 and 
Morsy and Caskey, Molecular Medicine Today, Jan. 1999, pgs. 18-24, both 
incorporated herein by reference. 

This example demonstrates how the fiber-expressing packaging lines can be 
used to generate pseudotyped particles of helper-dependent or 'gutless' vectors with 
altered tropisms. As the gutless vectors lack many or all Ad genes, they must be 
grown as mixed cultures In the presence of a helper virus which can provide the 
missing functions. To date, such helper viruses have provided all Ad functions 
except E1 , and El is complemented by growth in 293 cells or the equivalent. The 
resulting virus particles are harvested, and the helper virus is typically removed by 
CsCI gradient centrifugation (the vector chromosome is generally shorter than the 
helper chromosome, resulting in a difference in buoyant density between the two 
particles). 

An example of a gutless vector is pAdARSVDys (Haecker et ai. Human Gene 
Therapy 7:1907-1914 (1996)), This plasmid contains a full-length human dystrophin 
cDNA driven by the RSV promoter and flanked by Ad Inverted terminal repeats and 
packaging signals. 293 cells are infected with a first-generation Ad which series as 
a helper virus, and then transfected with purified pAdARSVDys DNA, Both the 
helper Ad genome and the pAdARSVDys DNA are replicated as Ad chromosomes, 
and packaged into particles using the viral proteins produced by the helper virus. 
Particles are isolated and the pAdARSVDys-containing particles separated from the 
helper by virtue of their smaller genome size and therefore different density on CsCi 
gradients. 



wo 00/42208 



PCT/EPOO/00265 



-96- 

To generate pseudotyped particles containing tine pAdARSVDys genome, the 
vector is grown in either 633 or 705 cells and AdS.PgaLAF is used as a helper virus. 
As in the published method, both the Ad5.pgal.AF and AdARSVDys genomes 
replicate and are packaged into particles. The AdS.pgal^F helper provides all the 
essential Ad proteins except fiber, and the fiber protein Is that produced by the cells 
(Ad5 fiber in 633 cells and Ad37 fiber in the case of 705 cells). The particles 
containing AdARSVDys genomes are then isolated by centrifugation. 

Tropism is evaluated by infecting Chang C cells (which express the Ad37 
receptor) and A549 cells which do not express this protein but do express the natural 
Ad5 receptor (CAR). The extent of infection is assessed by immunofluorescence 
staining of the infected cells with an anti-dystrophin antibody. By using purified 
recombinant AdS and Ad37 fiber proteins as competitors during infection, the usage 
of the expected receptors by the pseudotyped particles is evaluated. 



Example 15 
Targeting EBV-lnfected B Cells 

There are a number of cell types, such as EBV-transformed B-lymphocytes, 
that are involved in human disease which are not transducible using standard Ad 
vectors. To address this problem 'pseudotyped' AdS.pgal.AF particles containing 
either the wlldtype AdS fiber protein or a chimeric fiber with the receptor-binding 
knob domain of the adenovirus type 3 (Ad3) fiber were generated. (Von Seggem et 
aL, J. ViroL January, 2000). The strategy used for targeting the B-cells should be 
broadly applicable for targeting gene delivery to other specific cell types. 

Cells and Viruses, THP-1, MRC-5, FaDu, and A-10 cells were purchased 
from the ATCC. 21 IB is a 293-derived cell line that expresses the wild-type AdS 
fiber protein (Von Seggem et al., J, Qen. Virol. 79:1461-1468 (1998)), El- 2a 
(Gorziglla et al., J, ViroL 70:4173-4178 (1996)) is an A549-derived cell line which 
complements adenoviral El and E2a functions. The JR, TO, and TL LCL lines were 
established as described (Huang et aL, Proa NatL Acad. ScL 94.-8156-8161 (1997) 
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by EBV infection of lymphocytes from three normal donors. THP-1 and all LCL lines 
were maintained in RPMI 1640 medium (Gibco) + 10% fetal calf serum (PCS) 
(Hyclone). 211B, MRC-5, and A-10 cells were grown in DMEM + 10% PCS. E1-2a 
and its derivatives were grown in Richter's modified medium (BioWhitaker) + 10% 
PCS. Peripheral blood mononuclear cells were isolated from normal human blood 
(General Clinical Research Center, Scripps Clinic) by sedimentation on Picoll-Paque 
(Pharmacia) per the manufacturer's instructions. Wild type Ad2 and Ad3 were 
purchased from the ATCC. Construction of Ad5.pgal.wt and AdS.pgal.AP (Von 
Seggern et al., J, ViroL 73:1601-1608 (1999)) has been previously described. 
Av1LacZ4 (Mitttereder et al., J. ViroL 70:7498-7509 (1996)) is a first-generation AdS 
vector containing an RSV-driven p-galactosidase reporter gene. Av9LacZ4 
(Stevenson et aL, J. ViroL 7?:4782-4790 (1997)) is identical to Av1LacZ4 except that 
the fiber gene In the vector chromosome was replaced by a recombinant gene 
encoding a chimeric fiber protein with the receptor-binding domain of the Ad3 fiber 
(Stevenson et aL, J, viroL 69:2850-2857 (1995)). Accession numbers for the above 
are as follows. THP-1: TIB-202. MRC-5: CCL-171, FaDu: HTB-43. A-10: CRL-1476. 
Ad2: VR-846, Ad3: VR-3. 

DMA constructs. The complete Ad5 tripartite leader contained in pDV67 and 
pDV69 was constructed by assembly of PCR fragments. pDV55 was constructed 
similar to Example 5. This plasmid contains a 1,2 kb Bam Hl/Bg/ II fragment 
consisting of the first TPL exon, the natural first intron, and the fused second and 
third TPL exons. Finally. pDV60 was constructed by inserting this TPL cassette into 
the Bam HI site upstream of the Ad5 fiber gene in pcDNAS/Piber (Von Seggem et 
aL, J. Gen. ViroL 79:1461-1468 (1998)). pDV61 and pDV67 were then constructed 
similar to example 6. 

The chimeric Ad3/Ad5 fiber gene was amplified from pGEM5T3H (Stevenson 
et aL, J.ViroL 69:2850-2857 (1995) using the primers 5' ATG GGA TOG AAG ATG 
AAG CGC GCA AGA CCG 3'(SEQ ID NO: 75) and 5' CAC TAT AGO GGC CGC ATT 
CTC AGT CAT CTT 3* (SEQ ID NO:76) , and cloned to the Bam HI and Not I sites of 
pcDNA3.1/Zeo(+) via novel Bam HI and Not I sites (bold) engineered into the 
primers to create pDV68. Finally, the complete TPL fragment described above was 
then added to the unique Bam HI site of this plasmid to create pDV69. 

Construction of Stable Cell Lines, El -2a cells were electroporated as 
previously described (Von Seggern et aL, J, Gen. ViroL 75:1461-1468 (1998)) with 
pDV61, pDV67. or pDV69, and stable lines were selected with 600 \}g/m\ Zeocin 
(Invitrogen). Candidate clones were evaluated by immunofluorescence (Von 
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Seggern et al., J. Gen. Virol. 79:1461-1468 (1998)) using a polyclonal antibody 
generated against the Ad2 fiber (Wickham et aL, Cell 73,-309-3^9 (1993). Those 
lines expressing the highest level of nuclear fiber expression were further 
characterized. Line 601 and 633 were produced by transfection of pDV61 and 
pDV67, respectively, and therefore express the wildtype Ad5 fiber. Line 644 
contains pDV69 and expresses the chimeric 5T3H fiber. 

Virus Growth and Analysis. Adenovirus stocks were prepared in the indicated 
cell lines, and plaque-titered on 633 cells essentially as described (Von Seggem et 
aL, J. Virol. 73:1601-1608 (1999)). E1-2a cells (Gorziglia et al., J. Virol. 70:4173- 
4178 (1996). and their derivatives contain a dexamethasone-inducible construct for 
complementation of Ela. 601, 633, or 644 cells were therefore treated with 0.3 mM 
dexamethasone for 24 hours prior to infection, and 0.5 pM dexamethasone was 
included in the overlay for plaque assays. Protein concentration of viral 
preparations was determined using the BioRad Protein Assay (BioRad) with purified 
bovine serum albumin as a standard. Particle number was calculated using the 
formula 1 \ig protein = 4 x 10^ viral particles. Western blotting was performed as 
described (Von Seggern et aL, J. Gen. ViroL 79:1461-1468 (1998)) using polyclonal 
rabbit antibodies raised against either the Ad2 (Wickham et aL, Cell 73:309-319 
(1993) or Ad3 fibers (Stevenson etaL, J. ViroL 77:4782-4790 (1997). 

Determination of infection and binding to receptor was performed using 
methods known to those of skill in the art. 2x10^ cells in a total volume of 200 pi 
were incubated with the indicated Ad preparation for three hours at 37 °C. Cells 
were then washed twice with fresh medium, and returned to 37 °C, Two days later, 
cells were fixed and stained with X-gal and counted by light microscopy as described 
(Von Seggern et aL, J. ViroL 73:1601-1608 (1999)). For competition assays, cells 
were pre-incubated on ice for one hour with either recombinant Ad3 fiber (10 pg/ml) 
purified from baculovirus or with a crude baculovirus lysate (100 pg/ml) containing 
the recombinant Ad2 fiber protein (Wickham et aL, Cell 73:309-319 (1997)). 
Expression of Ov integrins on cell surfaces was assayed by FACS assay using 
monoclonal antibodies (the gift of David Cheresh, TSRI) against either Ovpa (LM609) 
or Ovps (P1F6) as previously described (Huang et aL, Proa. Natl. Acad. ScL USA 
54:8156-8161 (1997)). For virus binding assays, CsCI-purified Ad2 or Ad3 was 
labeled with ^^^1 using lodogen tubes (Pierce), Free iodine was removed by filtration 
with a PD-10 Sephadex column (Pharmacia). Cells (1 x 10® cells in a volume of 200 
[i\ either with or without a 100-fold excess of unlabeled virus) were rocked at 4 "^C for 
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two hours with 1x10^ cpm of the labeled virus, washed three times with PBS and 
counted. 

Altered in vitro tropism and infection of B lymphoid cell lines. 

Experiments with genetically modified viruses showed that a number of 
different cell types are more readily infected through interaction with the Ad3 
receptor than by the CAR-dependent pathway used by Ad5 (Stevenson et aL, J. 
Virol, 77:4782-4790 (1997)). In order to further evaluate the pseudotyping system, 
the ability of AdS.pgal.AF carrying either the Ad5 or chimeric 5T3H fibers to infect 
several cell lines was assayed: FaDu (a head and neck tumor line), THP-1 
monocytic cells, and MRC-5 fibroblasts were assayed. Consistent with the previous 
studies (Stevenson et al., J. Virol, 77:4782-4790 (1997)), use of the chimeric 
Ad5/Ad3 fiber protein increased infection of all of these lines at equal particle/cell 
ratios. In contrast, the rat smooth muscle cell line A-10 was infected somewhat more 
readily by Ad5- than by AdS-pseudotyped particles. 

Gene delivery to EBV-infected B cells could allow the development of 
therapies for a variety of lymphoproliferative disorders. For example, ex vivo purging 
of donor marrow to eliminate infected cells could reduce the risk of EBV-associated 
lymphoproliferative disease, and EBV-induced malignancies such as AIDS- 
associated lymphoma are also potential targets. However, neither B cells nor EBV- 
transformed lymphoblastoid cell lines (LCLs) are efficiently infected by Ad5-based 
vectors. As the tropism of AdS-pseudotyped particles appeared to be somewhat 
broader, it was asked whether EBV-infected LCLs could be infected using this 
system. The ability of AdS-pseudotyped particles to infect LCLs generated by EBV 
infection of lymphocytes from three different normal human donors was tested. In 
agreement with previous reports, there was little or no infection of these by particles 
carrying the Ad5 fiber. In contrast, virus particles equipped with the chimeric fiber 
protein were able to efficiently infect all of these lines. At equal particle/cell ratios, 
all LCLs examined were at least 10-fold more infectible using the AdS receptor. 

Further studies were performed to correlate the efficiency of infection with the 
level of attachment and Internalization receptors expressed by the cells. The three 
LCL lines tested ail bound very low levels of radiolabeled Ad2 particles, indicating 
that they expressed little or no CAR, In contrast, all three were able to specifically 
bind labeled AdS particles. This result suggested that fiber receptor distribution was 
largely responsible for the increased infection of these cells by AdS-pseudotyped 
particles. Selective gene delivery to EBV-infected cells. The results above 
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suggested that the minority of EBV-infected B cells present in donor marrow or 
peripheral blood would be preferentially infected by vectors using the Ad3 receptor. 
To test this hypothesis, a mixing experiment with normal uninfected peripheral blood 
mononuclear cells (PBMCs) and EBV-infected cells was performed. JR-LCL cells 
were mixed at varying ratios with PBMCs isolated from a normal human donor, and 
the mixture was then infected with AdS.pgaLAF particles containing the 5T3H fiber 
protein. No infection of normal PBMCs alone was detected. Moreover, the percent 
of total cells infected increased with the fraction of JR cells added. These 
experiments indicate that EBV-infected cells can be selectively infected in vitro by 
relatively short (3 hours) exposure to a retargeted Ad vector. 



Production of Adenovirus Vectors by Addition of Exogenous Fiber 

The production of fiberless viruses by growth in a complementing cell line 
may result In a preparation that also contains contaminating fiber genome resulting 
from recombination in the complementing cell lines. This disadvantage is eliminated 
by addition of exogenous fiber to a fiberless adenovirus vector. 

Production of fiberless virus by standard methods may include a two-step 
preparation protocol. This has been described in the earlier examples and is briefly 
described here again as follows: 

Step I - amplification of fiber containing fiberless virus (AdS/FyF" or 
AdS.pgaLAF - fiberless, but there is fiber on the surface, not encoded in genome) 
on 21 IB cell line (which stably expresses fiber), followed by CsCI-purification and 
characterization. 

Step II - preparation of virus particles lacking fiber (AdSF*) by infection of 8.8 
cell line with AdS/FVF*, followed by CsCI purification and characterization. This 
produces a large stock of particles which do not contain fiber. 

Step 1 is necessary because the infection efficiency of fiberless virus is 
extremely low, e.g. the dose of 20,000 particles/cell of Ad5/{3g F" gives only 10% 
infected cells. 

Contrary to the above, the production of fiberless virus by addition of 
exogenous fiber involves only a one-step protocol. The fiberless virus is amplified 
using the S.8 cell line with addition of exogenous fiber into infection media. The 
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amount of exogenous fiber necessary for production is very low, no more than 75ng 
of purified fiber required per roller bottle. If desired Xhe process may be followed by 
CsCI purification. As mentioned above, one advantage to this protocol is that it 
should provide no chance for recombination of adenovector during preparation. 

A 10 roller bottle (RB) preparation of fiberiess virus was made using the 
above two-step procedure. The yield of adenovector was 6.6x10^^ particles - total 
Ad/pgalF". A 1 RB preparation of fiberiess adenovector was also made from the 
same initial material using a one-step procedure with exogenous fiber. The total 
yield was 2.5x10^^ particles - Ad5PgalF'(one step procedure). 

DNA was isolated from both preparations and a PGR assay for fiber 
contamination was performed. (Figure 28). The PGR assay was developed for 
detection of very low amounts of fiber contamination, as low as lO'^^g. PGR assay 
showed much lower contamination for the preparation which was done by adding 
exogenous fiber (10*^^g one-step procedure) vs. 10'® two-step procedure). 
Therefore, less contamination was obtained by simpler one-step approach. 

Experiments were done using soluble purified fiber which does not have His- 
taq on the end (AdSFiber = 5F) and with His-taq on the end (AdSFiber His = 5FHis). 
These experiments showed that addition of AdSFiber can dramatically increase 
transduction efficiency of fiberiess adenovector by simply adding it exogenously to a 
fiberiess vector. The presence of the His tag on the AdSFiberHis doesn't have any 
effect. 

The results of these experiments suggest that the fiber is self-assembling with 
the fiberiess vector. This self-assembled virus can then infect cell through the 
normal entry pathway. (Figure 29) Also, an experiment was done using conditioned 
media from 633 cell line, which can stably express fiber. A Western blot analysis for 
633 condition media, showed that soluble fiber was present in the media during the 
period of cultivation of this cell line. Presence of soluble fiber in the media gives the 
possibility to increase transduction efficiency of fiberiess adenovector on the HDF 
cell line. (Figure 30) Because the HDF cell line doesn't have a GAR-receptor, it is 
especially difficult to transduce this particular cell line, not only with fiberiess vector, 
but also with regular fiber containing adenovector. Different amounts of 633 
conditioned media (250pl, SOOpI or lOOOpI) were added to infectious media during 
the incubation period with fiberiess adenovector. 
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This experiment also showed a role of soluble fiber in the process of cell 
entry. The conclusion is that by adding any fiber (wild-type, mutated, with ligand 
fusions) as long as on has the wild-type shaft (or region necessary to bind penton) 
one can retarget fiberless vector with any genome inside (gutless, oncolytic, 
expressing any transgene, etc.) to any cell type that your fiber is specific to. The 
advantage of this approach is that one does not have to make vectors with each new 
ligand. Just one fiberless vector need be made that can then be used to make 
different backbones by adding an exogenous "targetable" fiber off the shelf. 

The foregoing specification, including the specific embodiments and 
examples, is intended to be illustrative of the present invention and is not to be taken 
as limiting. Numerous other variations and modifications can be effected without 
departing from the true spirit and scope of the present invention. All publications, 
patents and patent applications cited herein are incorporated by reference in their 
entirety into the present disclosure. 
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What is claimed is: 

1. An isolated nucleic acid molecule comprising an adenovirus tripartite leader 
(TPL) nucleotide, said TPL nucleotide sequence comprising (a) first and second 
different TPL exons or (b) first, second and third same or different TPL exons, said 
TPL exons selected from the group consisting of complete TPL exon 1 , partial TPL 
exon 1, complete TPL exon 2 and complete TPL exon 3, wherein said isolated 
nucleic acid molecule may not comprise the nucleic acid sequence consisting of 
partial TPL exon 1 , complete TPL exon 2 and complete TPL exon 3. 

2. The isolated nucleic acid molecule of claim 1, wherein said sequence is 
operatively linked to an intron containing an RNA processing signal. 

3. The isolated nucleic acid molecule of claim 1 or 2 wherein said TPL 
nucleotide sequence consists essentially of complete TPL exon 1 operatively linked 
to complete TPL exon 2 operatively linked to complete TPL exon 3. 

4. The isolated nucleic acid molecule of claim 2 wherein said intron is native 
adenovirus intron 1. 

5. The isolated nucleic acid molecule of any one of claims 1 to 4 wherein said 
TPL nucleotide sequence is shown in SEQ ID NO: 32. 

6. The isolated nucleic acid molecule of claim 5 further comprising a promoter 
and a nucleic acid sequence which encodes an adenoviral structural protein, 
operatively linked to said promoter and said TPL sequence. 

7. The isolated nucleic acid molecule of claim 6 wherein said adenoviral 
structural protein is a fiber protein or a chimeric protein which includes an 
adenovirus fiber protein tail domain. 

8. The isolated nucleic acid molecule of claim 7 wherein said chimeric protein 
comprises an Ad3 head domain and an Ad5 tail domain or an Ad5 head domain and 
an Ad3 tail domain. 
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9. The isolated nucleic acid molecule of claim 7 wherein said molecule is 
contained in a plasmid selected from the group consisting of plasmids pCLF, pDVSO, 
pDV67, pDV69, pDVSO and pDV90. 

10. The isolated nucleic acid molecule of claim 9 wherein said molecule has a 
nucleotide sequence selected from the group consisting of sequences shown in SEQ 
ID NO: 8, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 47, SEQ ID NO: 64 and 
SEQ ID NO:65. 

11. An adenovirus vector complementing plasmid comprising an isolated nucleic 
acid molecule according to any one of claims 1 to 10. 

12. An adenovirus vector packaging cell line comprising a stably integrated 
nucleic acid molecule as claimed in any one of claims 1 to 11, an operatively-linked 
promoter and a nucleic acid sequence which encodes an adenovirus structural 
protein, wherein said TPL sequence consists essentially of a first complete TPL exon 
operatively linked to a complete second TPL exon operatively linked to a complete 
third TPL exon. 

13. The cell line of claim 12 wherein said first TPL exon Is a complete or partial 
first TPL exon. 

14. The cell line of claim 13 wherein said TPL molecule comprises complete TPL 
exon 1 having the nucleotide sequence of SEQ ID NO: 32 or partial TPL exon 1 
having the nucleotide of SEQ ID NO: 26. 

15. The cell line of claim 12 wherein said promoter is an inducible promoter. 

16. The cell line of claim 12 wherein said adenovirus structural protein is 
adenovirus fiber protein or a chimeric protein which includes an adenovirus fiber 
protein tail domain. 

17. The cell line of claim 12 wherein said chimeric protein comprises an Ad3 
head domain and an Ad5 tail domain or an Ad5 head domain and an Ad3 tail 
domain. 
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18. The cell line of claim 12 wherein said nucleic acid molecule is selected from 
the group consisting of plasmids pDV60, pDV67, pDV69, pDV80 and pDV90. 

19. The cell line of claim 18 wherein said nucleic acid molecule has a nucleotide 
sequence from the group consisting of sequences shown in SEQ ID NO: 43, SEQ ID 
NO: 44 and SEQ ID NO: 47. 

20. The cell line of claim 12 wherein said cell line is an epithelial cell line. 

21. The cell line of claim 20 wherein said cell line supports the production of a 
recombinant adenovirus vector genome by complementation of a deficient viral gene 
in said vector genome. 

22. The cell line of claim 21 wherein said cell line further produces an adenovirus 
protein and thereby complements a deficient adenovirus gene in said vector 
genome, and wherein said cell line complements an adenovirus early protein gene 
and a fiber gene. 

23. The cell line of claim 22 wherein the deletion of said deficient adenovirus 
gene is complemented by the expression of said gene under the control of an 
inducible promoter. 

24. A recombinant adenovirus particle comprising a recombinant adenovirus 
vector genome wherein said genome does not encode or does not express sufficient 
adenovirus fiber protein to support packaging of a fiber-containing adenovirus 
particle without complementation of said fiber gene in a packaging cell, wherein said 
packaging cell comprises an isolated nucleic acid molecule comprising an 
adenovirus tripartite leader (TPL) nucleotide sequence, and optionally an exogenous 
protein, said TPL nucleotide sequence comprising (a) first and second different TPL 
exons or (b) first, second and third same or different TPL exons, said TPL exons 
selected from the group consisting of complete TPL exon 1, partial TPL exon 1, 
complete TPL exon 2 and complete TPL exon 3 and, wherein said isolated nucleic 
acid molecule may not comprise the nucleic acid sequence consisting of partial TPL 
exon 1 , complete TPL exon 2 and complete TPL exon 3. 
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25. The recombinant adenovirus particle of claim 24 wherein said adenovirus 
vector genome does not encode one or more functional proteins selected from the 
group consisting of E1A, E1B, E2A, E2B, E3 and E4 protein. 

26. The particle of claim 24 wherein said adenovirus vector genome is 
AdS.BgaLAF. 

27. The particle of claim 24 wherein said adenovirus vector genome is contained 
in the adenovirus particle deposited under ATCC accession # VR2636 and 
corresponding to AdS.BgaLAF. 

28. The particle of claim 24 wherein said particle lacks fiber protein or contains a 
modified fiber protein. 

29. The particle of claim 24 wherein said particle comprises an adenovirus fiber 
protein or a chimeric protein having an adenovirus fiber protein tail domain, said 
chimeric protein comprising an Ad3 head domain and an Ad5 tail domain or an Ad5 
head domain and an AdS tail domain. 

30. The particle of claim 24 wherein said exogenous protein is a therapeutic gene 
product. 

31. A helper-independent fiberless recombinant adenovirus vector genome 
comprising genes which: 

(a) encode all adenovirus structural gene products but do not express 
sufficient adenovirus fiber protein to package a fiber-containing adenovirus particle 
in a packaging cell without complementation of said fiber gene or said genome lacks 
at least the fibre gene, wherein said packaging cell comprises an isolated nucleic 
acid molecule comprising an adenovirus tripartite leader (TPL) nucleotide sequence, 
said TPL nucleotide sequence comprising (a) first and second different TPL exons or 
(b) first, second and third same or different TPL exons, said TPL exons selected 
from the group consisting of complete TPL exon 1, partial TPL exon 1, complete TPL 
exon 2 and complete TPL exon 3 and, wherein said isolated nucleic acid molecule 
may not comprise the nucleic acid sequence consisting of partial TPL exon 1, 
complete TPL exon 2 and complete TPL exon 3. and 

(b) encode an exogenous protein. 
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32. The adenovirus vector genome of claim 31 wherein said adenovirus vector 
genome does not encode one or more functional proteins selected from the group 
consisting of E1A, E1B, E2A, E2B, E3 and E4 protein. 

33. The adenovirus vector genome of claim 31 wherein said adenovirus vector 
genome is AdS.Bgal. AF. 

34. The adenovirus vector genome of claim 33 wherein said adenovirus vector 
genome has a nucleotide sequence shown in SEQ ID NO;27 and corresponds to 
AdS.Bgal. AF. 

35. The adenovirus vector genome of claim 31 wherein said adenovirus vector 
genome is contained in the adenovirus particle deposited under ATCC accession 
VR-2636 corresponding to AdS.Bgal. AF. 

36. The adenovirus vector genome of claim 31 wherein said exogenous protein is 
a therapeutic gene product. 

37. An isolated nucleic acid that comprises the adenovirus vector genome of 
claim 31. 

38. A method for producing an adenovirus vector particle containing a helper- 
independent fiberless recombinant adenovirus vector genome, said method 
comprising providing a packaging cell line which complements replication and 
packaging of said genome and a helper-independent fiberless recombinant 
adenovirus vector genome which is deficient in expressing sufficient functional fiber 
protein to support assembly of fiber-containing particles, wherein said packaging cell 
comprises an isolated nucleic acid molecule comprising an adenovirus tripartite 
leader (TPL) nucleotide, said TPL nucleotide sequence comprising (a) first and 
second different TPL exons or (b) first, second and third different TPL exons, said 
TPL exons selected from the group consisting of complete TPL exon 1 , partial TPL 
exon 1 , complete TPL exon 2 and complete TPL exon 3 and, wherein said isolated 
nucleic acid molecule may not comprise the nucleic acid sequence consisting of 
partial TPL exon 1 , complete TPL exon 2 and complete TPL exon 3, and harvesting 
said particles produced by said cell line. 
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39. The method of claim 38 wherein said packaging cell line complements 
adenovirus fiber protein. 

40. The method of claim 38 wherein said adenovirus vector genome comprises 
genes that: 

(a) express all adenovirus structural gene products but do not express 
sufficient adenovirus fiber protein to package a fiber-containing adenovirus particle 
without complementation of said fiber gene or said genome lacks at least the fibre 
gene, and 

(b) express an exogenous protein. 

41 . The method of claim 38 wherein said packaging cell line comprises a stably 
integrated first nucleic acid molecule alternatively operatively linked to a promoter, 
and said first nucleic acid is operatively linked to a second nucleic acid molecule 
encoding an adenovirus structural protein, wherein said first nucleic acid molecule 
comprises an adenovirus tripartite leader (TPL) nucleotide sequence operatively 
linked to an intron containing an RNA processing signal, said TPL nucleotide 
sequence comprising (a) first and second different TPL exons or (b) first, second and 
third different TPL exons, said TPL exons selected from the group consisting of 
complete TPL exon 1, partial TPL exon 1, complete TPL exon 2 and complete TPL 
exon 3 and wherein said isolated nucleic acid molecule may not comprise the 
nucleic acid sequence consisting of partial TPL exon 1 , complete TPL exon 2 and 
complete TPL exon 3.. 

42. The method of claim 38 wherein said helper-independent fiberless 
recombinant adenovirus vector genome is introduced by infecting said cell line with a 
virus particle containing said genome. 

43. The method of claim 42 wherein said particle is a particle comprising a 
helper-independent recombinant adenovirus vector genome comprising genes that: 

(a) encode all adenovirus structural gene products but do not express 
sufficient adenovirus fiber protein to support packaging of a fiber-containing 
adenovirus particle without complementation of said fiber gene or said genome lacks 
at least the fibre gene, and 

(b) encode an exogenous protein, 
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wherein said particle comprises an adenovirus fiber protein or a chimeric 
protein that includes an adenovirus fiber protein tail domain. 

44. The method of claim 38 wherein said helper-independent fiberless 
recombinant adenovirus vector genome is introduced into said cell line by 
transfecting said cell line with said helper-independent fiberless recombinant 
adenovirus vector genome 

45. The method of claim 44 wherein said adenovirus vector genome comprises 
genes which: 

(a) encode all adenovirus structural gene products but do not express 
sufficient adenovirus fiber protein to package a fiber-containing adenovirus particle 
without complementation of said fiber gene or said genome lacks at least the fibre 
gene, and 

(b) encode an exogenous protein. 

46. The method of claim 38 wherein said packaging cell line is transfected with a 
nucleic acid molecule encoding adenovirus fiber protein. 

47. The method of claim 46 wherein said nucleic acid molecule is a nucleic acid 
molecule comprising an adenovirus tripartite leader (TPL) nucleotide sequence, said 
TPL nucleotide sequence comprising (a) first and second different TPL exons or (b) 
first, second and third different TPL exons, said TPL exons selected from the group 
consisting of complete TPL exon 1, partial TPL exon 1, complete TPL exon 2 and 
complete TPL exon 3 and said molecule further comprises a sequence encoding 
adenovirus fiber protein. 

48. The method of claim 39 wherein said adenovirus fiber protein is a modified 
fiber protein. 

49. The method of claim 38 further comprising the step of coating said particle 
with adenovirus fiber protein. 

50. A method for delivery of an exogenous gene to a target cell comprising 
contacting said cell with an amount of a recombinant adenovirus particle of claim 24 
sufficient to infect said cell. 
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Si. The method of claim 50 wherein said exogenous gene encodes a therapeutic 
gene product. 

52. The method of claim 51 wherein said recombinant adenovirus particle 
contains a modified fiber protein which binds a preselected target cell and directs 
delivery of the particle to said target cell. 

53. The method of claim 50 wherein said recombinant adenovirus particle 
comprises a helper-independent fiberless recombinant adenovirus vector genome 
comprising genes that: 

(a) encode all adenovirus structural gene products but do not express 
sufficient adenovirus fiber protein to package a fiber-containing adenovirus particle 
without complementation of said fiber gene or said genome lacks at least the fibre 
gene, and 

(b) encode an exogenous protein. 

54. The method of claim 50 wherein said modified fiber protein has an amino 
terminal head domain which binds to av integrins and thereby targets cells with ay 
integrin receptors. 

55. The method of claim 50 wherein said contacting is conducted in vitro. 

56. The method of claim 55 wherein said contacting is conducted on cells of a 
tissue which are first removed from the body of a patient, and the cells are 
subsequently returned to said patient. 

57. The method of claim 50 wherein said contacting is conducted in vivo by 
administering said recombinant adenovirus particle to a tissue of said patient. 

58. The method of claim 57 wherein said administering is intravenously, 
intraperitoneally, by aerosol, topically or by injection. 

59. A method for pseudotyping recombinant viral vectors comprising 
complementing a missing fiber gene of a helper-independent or helper dependent 
fibertess recombinant adenovirus vector genome by expressing in packaging cells a 
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fiber gene from a different adenoviral serotype than said recombinant adenovirus 
vector, thereby pseudotyping said vector. 

60. A method for specifically targeting an adenovirus vector to a cell of choice 
comprising introducing a helper-independent or helper-dependent fiberless 
recombinant adenovirus vector genome into a packaging cell line for producing a 
fiber gene-deleted adenovirus vector, wherein said gene for a missing fiber protein is 
complemented with a gene for a desired modification for targeting the vector to a cell 
of choice. 

61. A method for producing a modified adenovirus comprising providing in vitro an 
exogenous fiber protein to a fiberless adenovirus. 

62. The method of claim 61, wherein said fiber is provided by adding fiber protein 
in a suitable buffer to a fiberless virus preparation, thereby producing a modified 
adenovirus. 

63. The method of claim 61, wherein a helper-independent or helper-dependent 
fiberless recombinant adenovirus vector genome is introduced into a packaging cell 
line to produce a fiberless adenovirus to which exogenous fiber protein will be 
provided. 

64. A method for delivering a heterologous gene to EBV-infected B cells 
comprising infecting said B cells with a pseudotyped AdSpgal.AF particle or other 
fiber-deleted adenovirus particle, said particle having a chimeric fiber including the 
receptor-binding knob domain of the adenovirus type 3 fiber. 

65. The adenovirus particle of claim 24 wherein said adenovirus vector genome 
lacks a fibre gene or lacks a portion of the fibre gene sequence such that fibre 
protein is not expressed in sufficient quantities to support packaging. 

66. The recombinant adenovirus particle of claim 24 comprising a helper- 
independent recombinant adenovirus vector genome comprising genes that: 

(a) encode all adenovirus structural gene products but do not express 
sufficient adenovirus fiber protein to support packaging of a fiber-containing 
adenovirus particle without complementation of said fiber gene or said genome lacks 
at least the gene encoding fibre, and 
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(b) optionally encodes an exogenous protein. 

67. The recombinant adenovirus particle of claim 24 wherein said particle further 
comprises a nucleic acid encoding an exogenous protein. 

68. A method for producing a modified adenovirus comprising providing a 
packaging cell line for producing a fiberless adenovirus helper-dependent fiberless 
recombinant adenovirus vector genome and a helper virus vector, wherein said cell 
line complements at least a deficient fiber protein gene, thereby producing the 
modified adenovirus. 

69. The packaging cell line of claim 9 wherein said cell line is selected from the 
group consisting of 293, A549, W163, HeLa, Vero, 211, 211 A and an epithelial cell 
line comprising the stably integrated nucleic acid molecule. 

70. The recombinant adenovirus particle of claim 21 wherein said exogenous 
protein is selected from a group consisting of a tumor-suppressor protein, a 
biologically active fragment thereof, a suicide protein and a biologically active 
fragment thereof. 

71 . A composition for preparing a therapeutic vector, said composition comprising 
a plasmid comprising an adenovirus genome lacking a nucleotide sequence 
encoding a fiber protein or a genome that is incapable of expressing sufficient fiber 
to result in packaging. 

72. A method of delivering a heterologous gene to a human or any animal 
comprising providing said heterologous gene to a target cell wherein said target cell 
is contacted in vivo or ex vivo with an amount of a recombinant adenovirus particle of 
claim 24 sufficient to infect said cell and thereby deliver the heterologous gene. 

73. The recombinant adenovirus vector particle of either claim 24 or 31, wherein 
no fiber protein is expressed. 
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74. The recombinant adenovims vector particle of claim 24, wherein said genome 
expresses insufficient fiber to allow incorporation of said protein into the particle 
such that the particle cannot use the fiber pathway for infection. 

75. The recombinant adenovirus genome of claim 31. wherein said genome 
expresses insufficient fiber to allow incorporation of said protein into a particle such 
that the particle cannot use the fiber pathway for infection. 

76. A method for producing a gutless adenoviral vector particle comprising: 

a) delivering a helper adenovirus vector genome to an adenovirus vector 
packaging cell, wherein said helper adenovirus vector genome lacks any gene 
encoding adenovirus fiber protein or lacks the ability to encode sufficient adenovirus 
fiber protein to produce an adenoviral vector comprising fiber protein in the absence 
of complemetation by said packing cell and wherein said packaging cell comprises 
the nucleic acid molecule of claim 2 operably linked to a promoter and to an 
adenoviral fiber protein or to a chimeric protein that includes an adenovirus fiber 
protein tail domain; 

(b) delivering a gutless adenovirus vector genome to said packaging cell; and 

(c) recovering the gutless adenoviral vector particle produced by said cell. 

77. The method of claim 76, wherein said helper adenovirus vector genome is 
delivered by viral infection. 

78. The method of claim 77, wherein said gutless adenovirus vector genome is 
delivered by transfection. 

79. The method of claim 76, wherein said gutless adenovirus vector genome 
comprises an operable packaging sequence. 

80. The method of claim 79, wherein said helper adenovirus vector genome has a 
mutation in its packaging sequence that renders said genome substantially 
incapable of being packaged as an adenoviral vector particle by said packaging cell. 

81. The method of claim 79, wherein said helper adenovirus vector genome 
comprises recombinase sites flanking its packaging sequence and said packaging 
cell further comprises a nucleotide sequence encoding a recombinase. 
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82. The method of claim 81, wherein said recombinase site is a lox site and said 
recombinase is Cre. 

83. A helper adenovirus particle comprising an adenovirus vector genome that 
does not encode or does not express sufficient adenovirus fiber protein to support 
packaging of a fiber-containing adenovirus particle without complementation of said 
fiber gene, wherein said genome has a mutation in its packaging sequence that 
renders said genome substantially incapable of being packaged. 

84. The helper adenovirus particle of claim 83, wherein said mutation comprises a 
deletion of at least one nucleotide in said packaging sequence. 

85. The helper adenovirus particle of claim 84, wherein said adenovirus vector 
genome does not encode functional proteins selected from the group consisting of 
E1A, E1B, E2A, E2B, E3, and E4 proteins. 

86. A helper adenovirus particle comprising an adenovirus vector genome with 
recombinase sites flanking its packaging sequence, wherein said vector genome 
does not encode or does not express sufficient adenovirus fiber protein to support 
packaging of a fiber-containing adenovirus particle without complementation of said 
fiber gene. 

87. The helper adenovirus particle of claim 86, wherein said adenovirus vector 
genome does not encode functional proteins selected from the group consisting of 
El A, E1B, E2A, E2B, E3, and E4 proteins, 

88. An adenovirus particle comprising a gutless adenoviral vector genome and a 
fiberless capsid. 

89. An adenovims particle comprising a gutless adenoviral vector genome and a 
capsid comprising a modified fiber protein, 

90. A packaging cell for the production of a fiberiess or fiber-modified gutless 
adenovirus particle comprising an adenovirus vector complementing plasmid and a 
nucleotide sequence encoding a recombinase, wherein said complementing plasmid 
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comprises the nucleic acid molecule of claim 2 operably linked to a promoter and to 
a nucleotide sequence encoding an adenoviral fiber protein or a chimeric adenoviral 
fiber protein. 

91. The packaging cell of claim 90, wherein said complementing plasmid and said 
nucleotide sequence encoding a recombinase are stably integrated into the genome 
of said cell. 

92. The packaging cell of claim 90, further comprising a helper adenovirus vector 
genome. 

93. The packaging cell of claim 90, wherein said recombinase is Cre.. 
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Transduction Efficiency For Fiberless Virus with and without soluble fiber 
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SEQUENCE LISTING 



<llo> Novartis Ag 

The SCRIPPS RESEARCH INSTITUTE 



<120> ADENOVIRUS VECTORS, PACKAGING CELL LINES, COMPOSITIONS, 
AND METHODS FOR PREPARATION AND USE 

<130> 1294.0010001 

<140> 
<141> 

<160> 76 

<170> Patentin Ver, 2.1 

<210> 1 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 1 

cggtacacag aattcaggag acacaactcc 30 

<210> 2 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<210> 3 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: linker 
<400> 3 

cgcggatccg eg 12 

<210> 4 
<211> 8710 
<212> DNA 

<213> Artificial Sequence 



<400> 2 

gcctggatcc gggaagttac gtaacgtggg aaaac 



35 
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<220> , . . 

<223> Description of Artificial Sequence: piasmxa 

caccJaaatt gtaagcgtta atattttgtt aaaattcgcg ttaaattttt gttaaatcag 60 
ctcatttttt aaccaatagg ccgaaatcgg caaaatccct tataaatcaa aagaatagac 120 
cgagataggg ttgagtgtig ttocagtttg gaacaagagt ccactattaa agaacgtgga 180 
c?ccaacg?c aalgggcgaa aaaccgtcta tcagggcgat ggcccactac gtgaaccatc 240 
accctaatca agttttttgg ggtcgaggtg ccgtaaagca ctaaatcgga accctaaagg 300 
gagcccccga tttagagctt gacggggaaa gccggcgaac gtggcgagaa aggaagggaa 360 
gaLgcgala ggagcgggcg ctagggcgct ggcaagtgta gcggtcacgc tgcgcgtaac 420 
laccacaccc gccgcgctta atgcgccgct acagggcgcg tcccattcgc cattcaggct 480 
gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa 540 
Igggggatgt gltgcaaggc gattaagttg ggtaacgcca gggttttccc agtcacgacg 600 
tlgtlaaacg acggccagtg aattgtaata cgactcacta tagggcgaat tgggtaccgg 660 
gccccccctc gaggtcgacg gtatcgataa gcttgatatc gaattcagga gacacaactc 720 
laagtgcata ctctatgtca ttttcatggg actggtctgg ccacaactac attaatgaaa 780 
tatttgccac atcctcttac actttttcat acattgccca agaataaaga atcgtttgtg 840 
ttatg?ttca acgtgtttat ttttcaattg cagaaaattt caagtcattt ttcattcagt 900 
agta?agccc caccaccaca tagcttatac agatcaccgt accttaatca aactcacaga 960 
accctagtat tcaacctgcc acctccctcc caacacacag agtacacagt ""tctccc 1020 
cggctggcct taaaaagcat catatcatgg gtaacagaca tattcttagg tgttatattc 1080 
cicacigttt cctgtcgagc caaacgctca tcagtgatat taataaactc cccgggcagc 1140 
tcacttaagt tcatgtcget gtccagctgc tgagccacag gctgctgtcc aacttgcggt 1200 
tgcttaacgg gcggcgaagg agaagtccac gcctacatgg gggtagagtc ataatcgtgc 1260 
atcaggatag ggcggtggtg ctgcagcagc gcgcgaataa actgctgccg ccgccgctcc 1320 
gtcc£gcag5 latlcaacat ggcagtggtc tcctcagcga tgattcgcac cgcccgcagc 1380 
ataaggcgcc ttgtcctccg ggcacagcag cgcaccctga tctcacttaa atcagcacag 1440 
taacigcagc aclgcaccac aatattgttc aaaatcccac agtgcaaggc gctgtatcca 1500 
aagctcatgg cggggaccac agaacccacg tggccatcat accacaagcg caggtagatt 1560 
aagtggcgic ccctcataaa cacgctggac ataaacatta cctcttttgg catgttgtaa 1620 
ttcaccacct cccggtacca tataaacctc tgattaaaca tggcgccatc caccaccatc 1680 
ctaaaccagc tggccaaaac ctgcccgccg gctatacact gcagggaacc gggactggaa 1740 
caatgacagt ggagagccca ggactcgtaa ccatggatca tcatgctcgt catgatatca 1800 
atgt?ggclc lacLIggca cacgtgcata cacttcctca ggattacaag ctcctcccgc 1860 
gtLglacca tatcccaggg aacaacccat tcctgaatca gcgtaaatcc cacactgcag 1920 
ggaagacctc gcacgtaact cacgttgtgc attgtcaaag tgttacattc gggcagcagc 1980 
ggatgatcct ccagtatggt agcgcgggtt tctgtctcaa aaggaggtag acgatcccta 2040 
c?gtLggag tgcgccgaga caaccgagat cgtgttggtc gtagtgtcat gccaaatgga 2100 
acgccggacg tlg?ca?att tcctgaagca aaaccaggtg cgggcgtgac aaacagatct 2160 
gcgtclccgg tctcgccgct tagatcgctc tgtgtagtag ttgtagtata tccactctct 2220 
caaagcatcc aggcgccccc tggcttcggg ttctatgtaa actccttcat gcgccgctgc 2280 
cctgltaaca tccaccaccg cagaataagc cacacccagc caacctacac attcgttctg 2340 
cgagtcacac acgggaggag cgggaagagc tggaagaaco atgttttttt ttttattcca 2400 
alagattatc caalLctca aaatgaagat ctattaagtg aacgcgctcc cctccggtgg 2460 
cgtggtcaaa ctctacagcc aaagaacaga taatggcatt tgtaagatgt tgcacaatgg 2520 
cttccaaaag gcaaacggcc ctcacgtcca agtggacgta aaggctaaac ccttcagggt 2580 
gaatctcctc tataaacatt ccagcacctt caaccatgcc caaataattc tcatctcgcc 2640 
accttctcaa tatatctota agcaaatccc gaatattaag tccggccatt gtaaaaatct 2700 
gctccagagc gccctccacc ttcagcctca agcagcgaat catgattgca aaaattcagg 2760 
Stcctcacag acctgtataa gattcaaaag cggaacatta acaaaaatac cgcgatcccg 2820 
taggtccctt cgcagggcca gctgaacata atcgtgcagg tctgcacgga ccagcgoggc 2880 
cac?tccccg ccagglacct tgacaaaaga acccacactg attatgacac gcatactcgg 2940 
agctatgcti accagcgtag ccccgatgta agctttgttg catgggcggc gatataaaat 3000 
gcaaggtgct gctcaaaaaa tcaggcaaag cctcgcgcaa aaaagaaagc acatcgtagt 3060 
catgctcatg cagataaagg caggtaagct coggaaccac cacagaaaaa gacaccattt 3120. 
ttctctcaaa catgtctgcg ggtttctgca taaacacaaa ataaaataac aaaaaaacat 3180 
ttaaacatta gaagcct?tc ttacaacagg aaaaacaacc cttataagca taagacggac 3240 
tacggccatg Icggcgtgac cgtaaaaaaa ctggtcaccg tgattaaaaa goaccaccga 3300 
cagl?cctcg gtclt|tlcg gagtoataat gtaagactcg gtaaacacat caggttgatt 3360 
ca?cggtcag ?gctalaaag cgaccgaaat agcccggggg aatacatacc cgcaggcgta 3420 
gagalLcaf tacagccccc ataggaggta taacaaaatt aataggagag aaaaacacat 3480 



wo 00/42208 



PCT/EPOO/00265 



-3- 



aaacacctga 
acagcgcttc 
aaaaaacacc 
gcagagcgag 
ccagaaaacc 
caaatcgtca 
tctccgcaag 
ccggcttcca 
cagacaaggt 
gcggcataaa 
gcgagcgatc 
tgcaacgcgg 
cagcctcgcg 
ttcatccccg 
catgtcttta 
aacacgcaga 
cgtgtggcct 
ccgcagatcc 
agtttctgat 
aatctcgtgc 
gcgccgatgg 
cgat tccgga 
gccgtgcaca 
agccggtcgc 
tcggcccatt 
cgattgctga 
ccgtcgcgca 
acctcgtgca 
cggtcattga 
tcttctggag 
atccggagct 
aactctatca 
. gcgacgcaat 
gcgcggccgt 
ccagcactcg 
aggagacaat 
acgggtgttg 
gataccccac 
caccccccaa 
gccatagcca 
tcgtgggggt 
cagacccatg 
ccgggcgtct 
cgcccagtgc 
gccctgacgg 
gagctgcatg 
taatcagcca 
cctgaacctg 
taatggttac 
gcattctagt 
gttctagagc 
atttcgagct 
acaattccac 
gtgagctaac 
tcgtgccagc 
cgctcttccg 
gtatcagctc 
aagaacatgt 
gcgtttttcc 
aggtggcgaa 
gtgcgctctc 
ggaagcgtgg 



aaaaccctcc 
acagcggcag 
actcgacacg 
tatatatagg 
gcacgcgaac 
cttccgtttt 
aattgattgg 
ttcaggtcga 
atagggcggc 
tcgccgtgac 
cttgaagctg 
gcatcccgat 
tcgcgaacgc 
tggcccgttg 
gttctatgat 
tg'cagtcggg 
cgaacaccga 
cgggcaatga 
cgaaaagttc 
tttcagcttc 
tttctacaaa 
agtgcttgac 
gggtgtcacg 
ggaggccatg 
cggaccgcaa 
tccccatgtg 
ggctctcgat 
cgcggatttc 
ctggagcgag 
gccgtggttg 
tgcaggatcg 
gagcttggtt 
cgtccgatcc 
ctggaccgat 
tccgagggca 
accggaagga 
ggtcgtttgt 
cgagacccca 
gttcgggtga 
ctggccccgt 
tattattttg 
gtttttggat 
gtggctgcca 
cgtcgaccgg 
gcttgtctgc 
tgtcagaggt 
taccacattt 
aaacataaaa 
aaataaagca 
tgtggtttgt 
ggccgccacc 
tggcgtaatc 
acaacatacg 
tcacattaat 
tgcattaatg 
cttcctcgct 
actcaaaggc 
gagcaaaagg 
ataggctccg 
acccgacagg 
ctgttccgac 
cgctttctca 



tgcctaggca 
cctaacagtc 
gcaccagctc 
actaaaaaat 
ctacgcccag 
cccacgttac 
ctccaattct 
ggtggcccgg 
gcctacaatc 
gatcagcggt 
tccctgatgg 
gccgccggaa 
cagcaagacg 
ctcgcgtttg 
gacacaaacc 
gcggcgcggt 
gcgaccctgc 
gatatgaaaa 
gacagcgtct 
gatgtaggag 
gatcgttatg 
attggggaat 
ttgcaagacc 
gatgcgatcg 
ggaatcggtc 
tatcactggc 
gagctgatgc 
ggctccaaca 
gcgatgttcg 
gcttgtatgg 
ccgcggctcc 
gacggcaatt 
ggagccggga 
ggctgtgtag 
aaggaatagg 
acccgcgcta 
tcataaacgc 
ttggggccaa 
aggcccaggg 
gggttaggga 
ggcgttgcgt 
ggcctgggca 
aacacccccg 
tcatggctgc 
tcccggcatc 
tttcaccgtc 
gtagaggttt 
tgaatgcaat 
atagcatcac 
ccaaactcat 
gcggtggagc 
atggtcatag 
agccggaagc 
tgcgttgcgc 
aatcggccaa 
cactgactcg 
ggtaatacgg 
ccagcaaaag 
cccccctgac 
actataaaga 
cctgccgctt 
tagctcacgc 



aaatagcacc 
agccttacca 
aatcagtcac 
gacgtaacgg 
aaacgaaagc 
gtaacttccc 
tggagtggtg 
ctccatgcac 
catgccaacc 
ccagtgatcg 
tcgtcatcta 
gcgagaagaa 
tagcccagcg 
ctggcggtgt 
ccgcccagcg 
cccaggtcca 
agcgacccgc 
agcctgaact 
ccgacctgat 
ggcgtggata 
tttatcggca 
tcagcgagag 
tgcctgaaac 
ctgcggccga 
aatacactac 
aaactgtgat 
tttgggccga 
atgtcctgac 
gggattccca 
agcagcagac 
gggcgtatat 
tcgatgatgc 
ctgtcgggcg 
aagtactcgc 
ggagatgggg 
tgacggcaat 
ggggttcggt 
tacgcccgcg 
ctcgcagcca 
cggggtcccc 
ggggtctggt 
tggaccgcat 
acccccaaaa 
gccccgacac 
cgcttacaga 
atcaccgaaa 
tacttgcttt 
tgttgttgtt 
aaatttcaca 
caatgtatct 
tccagctttt 
ctgtttcctg 
ataaagtgta 
tcactgcccg 
cgcgcgggga 
ctgcgctcgg 
ttatccacag 
gccaggaacc 
gagcatcaca 
taccaggcgt 
accggatacc 
tgtaggtatc 



ctcccgctcc 
gtaaaaaaga 
agtgtaaaaa 
ttaaagtcca 
caaaaaaccc 
ggatccgcgg 
aatccgttag 
cgcgacgcaa 
cgttccatgt 
aagttaggct 
cctgcctgga 
tcataatggg 
cgtcggccgc 
ccccggaaga 
tcttgtcatt 
cttcgcatat 
ttaacagcgt 
caccgcgacg 
gcagctctcg 
tgtcctgcgg 
ctttgcatcg 
cctgacctat 
cgaactgccc 
tcttagccag 
atggcgtgat 
ggacgacacc 
ggactgcccc 
ggacaatggc 
atacgaggtc 
gcgctacttc 
gctccgcatt 
agcttgggcg 
tacacaaatc 
cgatagtgga 
gaggctaact 
aaaaagacag 
cccagggctg 
tttcttcctt 
acgtcggggc 
catggggaat 
ccacgactgg 
gtactggcgc 
accaccgcgc 
ccgccaacac 
caagctgtga 
cgcgcgaggc 
aaaaaacctc 
aacttgttta 
aataaagcat 
tatcatgtct 
gttcccttta 
tgtgaaattg 
aagcctgggg 
ctttccagtc 
gaggcggttt 
tcgttcggct 
aatcagggga 
gtaaaaaggc 
aaaatcgacg 
ttccccctgg 
tgtccgcctt 
tcagttcggt 



agaacaacat 
aaacctatta 
agggccaagt 
caaaaaacac 
acaacttcct 
cattcacagt 
cgaggtgccg 
cgcggggagg 
gctcgccgag 
ggtaagagcc 
cagcatggcc 
gaaggccatc 
catgccctgc 
aatatatttg 
ggcgaattcg 
taaggtgacg 
caacagcgtg 
tctgtcgaga 
gagggcgaag 
gtaaatagct 
gccgcgctcc 
tgcatctccc 
gctgttctgc 
acgagcgggt 
ttcatatgcg 
gtcagtgcgt 
gaagtccggc 
cgcataacag 
gccaacatct 
gagcggaggc 
ggtcttgacc 
cagggtcgat 
gcccgcagaa 
aaccgacgcc 
gaaacacgga 
aataaaacgc 
gcactctgtc 
ttccccaccc 
ggcaggccct 
ggtttatggt 
actgagcaga 
gacacgaaca 
ggatttctgg 
ccgctgacgc 
ccgtctccgg 
agccggatca 
cccacctccc 
ttgcagctta 
ttttttcact 
ggatccacta 
gtgagggtta 
ttatccgctc 
tgcctaatga 
gggaaacctg 
gcgtattggg 
gcggcgagcg 
taacgcagga 
cgcgttgctg 
ctcaagtcag 
aagctccctc 
tctcccttcg 
gtaggtcgtt 



3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900' 

6960 

7020 

7080 

7140 

7200 
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cgctccaagc tgggctgtgt .-cgaaccc cccgtca.c ccgacc.ct. ^^^^^^.To nlfo 
ggtaactatc gtcttgagtc caacccggta ^^^^^^^act J cttgaagtgg 7380 

actggtaaca ggattagcag agcgaggtat gtaggcggtg ^ gotgaagcca 74 40 

tgglctaact acggctacac tagaaggaca g^atttggta tctg^g^^^^ Igctggtagc 7500 
gttaccttcg gaaaaagagt ta^tagctct tgatccgg tcaagaagat 7560 

ggtggttttt ttgtttgcaa gcagoagatt ^^^^^^ aaaacicacg ttaagggatt 7620 
cctttgatct tttctacggg f^^^gacgct ^^^gg g ^^^^^^^ aaaatgaagt 7680 
ttggtcatga gattatcaaa "ggatcttc ^"^ag ^ ^^^^^^ atgcttaato 7740 

tttaaatcaa tctaaagtat atatgagtaa ^ ccatagttgc ctgactcccc 7800 

agtgaggcac ctatctcagc gatctgtcta "tcgttcat ^ | tgcaatgata 78 60 

glcgtgtaga taact^cgat --^^^^^H ^J|^eagcal taaaccagcc a?ccggaagg 7920 
ccgcgagacc cacgctcacc S^ctccagat ^tatcag ^^^^^at taattgttgc 7980 

gccgagcgca gaagtggtcc tgcaacttta ^ccgcc^cc ^ tgccattgct 8040 

Igglaagcta. gagtaagtag "cgccagtt ^-^^^^^J^ I^ttcIgcL cggttcccaa 8100 
acaggcatcg tggtgtcacg ctcgtcgttt ^g^a^gg aagcggttag ctccttcggt 8160 
cgatcaaggc gagttacatg -tcccocatg "gtgcaaaa ^ag^^^^^ ^^^^^^^^^^ 3220 
cctccgatcg ttgtcagaag taagttggcc ^^^^ tttctgtgac tggtgagtao 8280 

ctgcataatt ctcttactgt =^tgccatcc f^^^^^^ g"gc?c?tg cllggcgtca 8340 
tcaaccaagt cattctgaga atagtgtatg ^ggcgaccga 9 ^ ^ tggaaaacgt 8400 
atacgggata ataccgcgcc acatagcaga -^^Jaaaag ^gctc J? 3,,^^ 8460 

tcttcggggc gaaaactctc aaggatctta "^^^gttga ^ ^ ^^^^ t^gggtgagca 8520 
actcgtgcac ccaactgatc ttcagcatct """" cgacacggaa atgttgaata 8580 
aaaacaggaa ggcaaaatgc cg--aa^a5 ^gaataaggg -9--acgg ^J^^^ 
ctcatactct tcctttttca atattattga ^gcactra^c cacatttccc 8700 

ggatacatat ttgaatgtat ttagaaaaat aaacaaatag gggttccg g ^^^^ 



cgaaaagtgc 



<210> 5 
<211> 30 
<212> DNA 

<213> Artificial Sequence 



<223> Description of Artificial Sequence: primer 

<400> 5 30 
atgggatcca agatgaagcg cgcaagaccg 



<210> 6 
<211> 30 
<212> DNA 

<213> Artificial Sequence 



<223> Description of Artificial Sequence: primer 

<400> 6 30 
cataacgcgg ccgcttcttt attcttgggc 



<210> 7 
<211> 7148 
<212> DNA 

<213> Artificial Sequence 



^223> Description of Artificial Sequence: plasmid 

glcggaLgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 60 
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ccgcatagtt aagccagtat ctgctccctg 
cgagcaaaat ttaagctaca acaaggcaag 
ttagggttag gcgttttgcg ctgcttcgcg 
gattattgac tagttattaa tagtaatcaa 
tggagttccg cgttacataa cttacggtaa 
cccgcccatt gacgtcaata atgacgtatg 
attgacgtca atgggtggac tatttacggt 
atcatatgcc aagtacgccc cctattgacg 
atgcccagta catgacctta tgggactttc 
tcgctattac catggtgatg cggttttggc 
actcacgggg atttccaagt ctccacccca 
aaaatcaacg ggactttcca aaatgtcgta 
gtaggcgtgt acggtgggag gtctatataa 
ctgcttactg gcttatcgaa attaatacga 
gagctcggat ccaagatgaa gcgcgcaaga 
ccatatgaca cggaaaccgg tcctccaact 
cccaatgggt ttcaagagag tccccctggg 
gttacctcca atggcatgct tgcgctcaaa 
ggcaacctta cctcccaaaa tgtaaccact 
aacataaacc tggaaatatc tgcacccctc 
gccgccgcac ctctaatggt cgcgggcaac 
accgtgcacg actccaaact tagcattgcc 
aagctagccc tgcaaacatc aggccccctc 
actgcctcac cccctc.taac tactgccact 
atttatacac aaaatggaaa actaggacta 
gacctaaaca ctttgaccgt agcaactggt 
caaactaaag ttactggagc cttgggtttt 
gcaggaggac taaggattga ttctcaaaac 
tttgatgctc aaaaccaact aaatctaaga 
gcccacaact tggatattaa ctacaacaaa 
tccaaaaagc ttgaggttaa cctaagcact 
atagccatta atgcaggaga tgggcttgaa 
cccctcaaaa caaaaattgg ccatggccta 
aaactaggaa ctggccttag ttttgacagc 
aatgataagc taactttgtg gaccacacca 
gagaaagatg ctaaactcac tttggtctta 
gtttcagttt tggctgttaa aggcagtttg 
catcttatta taagatttga cgaaaatgga 
gaatattgga actttagaaa tggagatctt 
ggatttatgc ctaacctatc agcttatcca 
attgtcagtc aagtttactt aaacggagac 
ctaaacggta cacaggaaac aggagacaca 
tgggactggt ctggccacaa ctacattaat 
tcatacattg cccaagaata aagaagcggc 
ctatagtgtc acctaaatgc tagagctcgc 
cagccatctg ttgtttgccc ctcccccgtg 
actgtccttt cctaataaaa tgaggaaatt 
attctggggg gtggggtggg gcaggacagc 
catgctgggg atgcggtggg ctctatggct 
agggggtatc cccacgcgcc ctgtagcggc 
cgcagcgtga ccgctacact tgccagcgcc 
tcctttctcg ccacgttcgc cggctttccc 
gggttccgat ttagtgcttt acggcacctc 
tcacgtagtg ggccatcgcc ctgatagacg 
ttctttaata gtggactctt gttccaaact 
tcttttgatt tataagggat tttggggatt 
taacaaaaat ttaacgcgaa ttaattctgt 
ccccaggctc cccaggcagg cagaagtatg 
aggtgtggaa agtccccagg ctccccagca 
tagtcagcaa ccatagtccc gcccctaact 
tccgcccatt ctccgcccca tggctgacta 
gcctctgcct ctgagctatt ccagaagtag 



-5- 

cttgtgtgtt ggaggtcgct gagtagtgcg 120 
gcttgaccga caattgcatg aagaatctgc 180 
atgtacgggc cagatatacg cgttgacatt 240 
ttacggggtc attagttcat agcccatata 300 
atggcccgcc tggctgaccg cccaacgacc 360 
ttcccatagt aacgccaata gggactttcc 420 
aaactgccca cttggcagta catcaagtgt 480 
tcaatgacgg taaatggccc gcctggcatt 540 
ctacttggca gtacatctac gtattagtca 600 
agtacatcaa tgggcgtgga tagcggtttg 660 
ttgacgtcaa tgggagtttg ttttggcacc 720 
acaactccgc cccattgacg caaatgggcg 780 
gcagagctct ctggctaact agagaaccca 840 
ctcactatag ggagacccaa gcttggtacc 900 
ccgtctgaag ataccttcaa ccccgtgtat 960 
gtgccttttc ttactcctcc ctttgtatcc 1020 
gtactctctt tgcgcctatc cgaacctcta 1080 
atgggcaacg gcctctctct ggacgaggcc 114 0 
gtgagcccac ctctcaaaaa aaccaagtca 1200 
acagttacct cagaagccct aactgtggct 1260 
acactcacca tgcaatcaca ggccccgcta 1320 
acccaaggac ccctcacagt gtcagaagga 1380 
accaccaccg atagcagtac ccttactatc 144 0 
ggtagcttgg gcattgactt gaaagagccc 1500 
aagtacgggg ctcctttgca tgtaacagac 1560 
ccaggtgtga ctattaataa tacttccttg 1620 
gattcacaag gcaatatgca acttaatgta 1680 
agacgcctta tacttgatgt tagttatccg 174 0 
ctaggacagg gccctctttt tataaactca 1800 
ggcctttact tgtttacagc ttcaaacaat 1860 
gccaaggggt tgatgtttga cgctacagcc 1920 
tttggttcac ctaatgcacc aaacacaaat 1980 
gaatttgatt caaacaaggc tatggttcct 2040 
acaggtgcca ttacagtagg aaacaaaaat 2100 
gctccatctc ctaactgtag actaaatgca 2160 
acaaaatgtg gcagtcaaat acttgctaca 2220 
gctccaatat ctggaacagt tcaaagtgct 2280 
gtgctactaa acaattcctt cctggaccca 234 0 
actgaaggca cagcctatac aaacgctgtt 2400 
aaatctcacg gtaaaactgc caaaagtaac 24 60 
aaaactaaac ctgtaacact aaccattaca 2520 
actccaagtg catactctat gtcattttca 2580 
gaaatatttg ccacatcctc ttacactttt 2640 
cgctcgagca tgcatctaga gggccctatt 2700 
tgatcagcct cgactgtgcc ttctagttgc 2760 
ccttccttga ccctggaagg tgccactccc 2820 
gcatcgcatt gtctgagtag gtgtcattct 2880 
aagggggagg attgggaaga caatagcagg 2 940 
tctgaggcgg aaagaaccag ctggggctct 3000 
gcattaagcg cggcgggtgt ggtggttacg 3060' 
ctagcgcccg ctcctttcgc tttcttccct 3120 
cgtcaagctc taaatcgggg catcccttta 3180 
gaccccaaaa aacttgatta gggtgatggt 3240 
gtttttcgcc ctttgacgtt ggagtccacg 3300 
ggaacaacac tcaaccctat ctcggtctat 3360 
tcggcctatt ggttaaaaaa tgagctgatt 3420 
ggaatgtgtg tcagttaggg tgtggaaagt 3480 
caaagcatgc atctcaatta gtcagcaacc 3540 
ggcagaagta tgcaaagcat gcatctcaat 3600 
ccgcccatcc cgcccctaac tccgcccagt 3660 
atttttttta tttatgcaga ggccgaggcc 3720 
tgaggaggct tttttggagg cctaggcttt 3780 
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tgcaaaaagc tcccg.ga.c ttgtatatcc attttc a ct.atcaaga .-aggatga 3840 
ggatcgtttc gcatgattga acaagatgga "^^^^^^^ tgccgccgtg 3960 

gigaggctat toggctatga ctgggcacaa =^gacaatcg 9=^9=^^ g?ccggtgcc 4020 
Itccggctgt cagcgcaggg gcgcccggtt =tttttgtca H^mf ^ggcgttcct 4080 
ctgallgaac tgcaggacga ggcagcgcgg ^^atcgtggc tggccacga | ^^^^ 

tglgoagctg tgctcgacgt tf^^^ctgaa 'i^l^^^^^l ^=*|^^algt atclILatg 4200 
glgccggggc aggatctcct gtcatctcac ^ttgctcctg ^=^^9 cgaccaccaa 4260 

gctgatgcaa tgcggcggct gcatacgctt g^^ccggcta ==9 cgatcaggat 4320 

gcgaaacatc gcatcgagcg agcacgtact ^ggatggaag ^^99^ ^ 4330 
latctggacg aagagcatca ggs^^^cgcg -^^^^aao tgtt^g g ^^^^^3^,,^ 4440 
cgcatgcccg acggcgagga tctcgtcgtg a^ccarggcg ^ ^ tgtggcggac 4500 
a^ggtggaaa atggccgctt ttctggattc ^^cgactgtg ^^^^^ctggg ? |S ^^^^ 
cgllaicagg acatagcgtt ggctacccgt g^^attgctg ^^^^^^^^g | ^^.^ 4g20 

gitgaccgct; tcctcgtgct ttacggtatc g==9=^cccg ^"cgcagcg | ^gg^ 
Ltcgccttc ttgacgagtt -"ctgagcg gg-tctggg gttcgaaatg 9 ^ 
cgacgcccaa cctgccatca cgagatttcg ^"ccaccgc ^ gatctcatgc 4800 

gittcggaat cgttttccgg g^cgccggct W-tgatcct ^=^9=9 ^^g ^^^^^^^^^^ 4360 
tggagttctt cgcccacccc aacttgttta t^gcagctta ^ ^ tgtggtttgt 4 920 

aiagcatcac aaatttcaca aataaagcat ttttttcact ^^^^tctag 9 ^g y ^^^^ 
ccaLctcat caatgtatct tatcatgtct gtataccgtc g^cctctag ^^^^^^^^^l ^^^^ 
cgtaatcatg gtcatagctg tttcctgtgt ^^^^^tgtta tccgc ^^^^^ca 5100 

acatacgagc cggaagcata aagtgtaaag ^^^ggggtgc ^^aatgag g s^gO 
cattaattgc gttgogctca ctgcccgctt tccagtcggg HHf^^^^^ tcttccgctt 5220 
attaatgaat cggccaacgc gcggggagag gcggtttgcg ^attgggcg ^^^^ ^^BO 

cctcgctcac tgactcgctg cgctcggtcg ttcggctgcg gc^^^cggta 9 53^0 
caaaggcggt aatacggtta tccacagaat "g999^^aa jg^J^SJ ^ ^^^^^^J^^ ^^^^ 
caaaaggcca gcaaaaggcc aggaaccgta aaaaggccgc 9^^9 99 9 ^ggcgaaacc 54 60 
ggctccgccc ccctgacgag catcacaaaa atcgacgctc ^^S^cagagg ^^^0 
Igacaggact ataaagatac caggcgtttc ^^cctggaag =^ccctcg g 9 5530 
ttccgaccct gccgcttacc ggatacctgt ^cgcctttct ^^^^tcggg ^ ^ ^g^y ^^^^ 
tttctcaatg ctcacgctgt aggtatctca gtt^ggtgta ^^^^^ttcg 5^00 
gctgtgtgca cgaacccccc g^tcagcccg accgctgcgc cttatccggt ^.^^ 
ttgagtccaa cccggtaaga cacgacttat ^^^^^^^^gc cctaactacg 5820 

ttlgcagagc gaggtatgta ggcggtgcta <=agagttctt ^^^^^^gcgg ^ 5330 

gctlcactag aaggacagta "tggtatct g-^^^^^g ^gg^ajcggt ggtttt?ltg 5940 
aaagagttgg tagctcttga tccggcaaac ^^^-^^^^^c ^^^"g 99 atctttt 6000 

tttgcaagca gcagattacg cgcagaaaaa aaggatctca ^gaagatcct g g^gQ 
ctacggggtc tgacgctcag tggaacgaaa -tcacgtta agggattttg g^^^^^^'^^ ^^^^ 
tatcaaaaag gatcttcacc tagatccttt ^aaattaaaa I gaggcaccta 6180 

aaagtatata tgagtaaact tggtctgaca f ^a^caatg ^^taatcag g 99 ^^^^ ^24 0 
tctlagcgat ctgtctattt cgttcatcca tagttgcctg ^^Jj^^^^t ^ ^ ^^^^^ ^^^^ 
ctacgatacg ggagggctta ccatctggcc -^-^tgctgc ^^^gat g 9 | ^^^^ gggo 
gctcaccggc tccagattta tcagcaataa ^ccagccagc gaagctagag 6420 

gtggtcctgc aactttatcc gcctccatcc agtctattaa "gt g 99 « ^ ^gg 6480 
taagtagttc gccagttaat agtttgcgca acgttgttgc ""9^^^^^ ?caaggcgag 6540 
tgtcacgctc gtcgtttggt atggcttcat tcagctccgg ttcccaacga tca^gg g^g ^^^^ 
ttacatgatc ccccatgttg tgcaaaaaag ^g^ttagctc ^"^^g ^g^aattctc 6660 

tcagaagtaa gttggccgca gtgttatcac tcatggttat gg--9^^^^9 ^^^^ .^^^ 
ttactgtcat gccatccgta agatgctttt ctgtgactgg ^9 9 cgggataata 6780 

tctga^aata gtgtatgcgg cgaccgagtt gctcttgccc gg-^tcaata cggg^^^^^^ 
ccgcgccaca tagcagaact ttaaaagtgc «=catcattgg ^ ^ cgtgcaccca 6900 

aaltctcaag gatcttaccg ctgttgagat «agttcgat g"acccac 9^ gggQ 
actgatcttc agcatctttt actttcacca g^9tttctgg 9tgagcaaaa 99^^99^ ^^^o 
aaaatgccgc aaaaaaggga ataagggcga «^9gaaatg ttgaat ^^^^.^^ttg 7080 

tttttcaata ttattgaagc atttatcagg 9ttattgtct =|^9 aaagtgccac 7140. 

aatgtattta gaaaaataaa caaatagggg ttccgcgcac atttccccg y y ^^^^ 



ctgacgtc 



<210> 8 
<211> 7469 
<212> DNA 
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<213> Artificial sequence 

<220> . ^. Artificial Sequence: plasmid 

<223> Description of Artirxci 

■■■■■■I 

SiS aii Si HSi! Hfa SSiH I 
iiS HiHi Eirii iSS » H 

iiiiiiiiiiiiiii 

',^?s"Sc r,r-r.r,c 4jc..cc» .cot=cc.a. 1,^, , 
r,«r=*" ""-?r- i dbsi ^irdi?" 2 

''rff.'.?! «l"'cllt ^laSctr.: ^t^d,,., cct„,,«t tc.c.. 1980 
*rc'.««r= S«taat,t SSSr.! traSct.., acta„.=., 2100 

at.cttgat, ttagttatoc gtttgatgct actacaacaa aggcotttac 2160 

fgStSt.? tlllllT.ll "t?rag5tta aec«.a|C.c t,o=a.,,,, a2|0 

SSISI I^SU'r, o.»|==;« -rall?r, f£t|a^ 23J0 

fcr."ns ra"»a"i if — r» ^rc" «| 

aacagtcaaa tacttgctac ag^ttcagtt acgaaaatgg agtgctacta 2640 

r,=^ c"r,ig fa W is:f-n '.:r,s'aS? p 

r™:, ,e~| 2f-|f4l «'g, og=.?ta.,c 3360 

gaaagaacca gctggggctc taggggy 
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gcggcgggtg tggtggttac gcgcagcgtg 
gctcctttcg ctttcttccc ttcctttctc 
ctaaatcggg gcatcccttt agggttccga 
aaacttgatt agggtgatgg ttcacgtagt 
cctttgacgt tggagtccac gttctttaat 
ctcaacccta tctcggtcta ttcttttgat 
tggttaaaaa atgagctgat ttaacaaaaa 
gtcagttagg gtgtggaaag tccccaggct 
catctcaatt agtcagcaac caggtgtgga 
atgcaaagca tgcatctcaa ttagtcagca 
ccgcccctaa ctccgcccag ttccgcccat 
atttatgcag aggccgaggc cgcctctgcc 
ttttttggag gcctaggctt ttgcaaaaag 
tctgatcaag agacaggatg aggatcgttt 
ggttctccgg ccgcttgggt ggagaggcta 
ggctgctctg atgccgccgt gttccggctg 
aagaccgacc tgtccggtgc cctgaatgaa 
ctggccacga cgggcgttcc ttgcgcagct 
gactggctgc tattgggcga agtgccgggg 
gccgagaaag tatccatcat ggctgatgca 
acctgcccat tcgaccacca agcgaaacat 
gccggtcttg tcgatcagga tgatctggac 
ctgttcgcca ggctcaaggc gcgcatgccc 
gatgcctgct tgccgaatat catggtggaa 
ggccggctgg gtgtggcgga ccgctatcag 
gaagagcttg gcggcgaatg ggctgaccgc 
gattcgcagc gcatcgcctt ctatcgcctt 
ggttcgaaat gaccgaccaa gcgacgccca 
ccgccttcta tgaaaggttg ggcttcggaa 
tccagcgcgg ggatctcatg ctggagttct 
ataatggtta caaataaagc aatagcatca 
tgcattctag ttgtggtttg tccaaactca 
cgacctctag ctagagcttg gcgtaatcat 
atccgctcac aattccacac aacatacgag 
cctaatgagt gagctaactc acattaattg 
gaaacctgtc gtgccagctg cattaatgaa 
gtattgggcg ctcttccgct tcctcgctca 
ggcgagcggt atcagctcac tcaaaggcgg 
acgcaggaaa gaacatgtga gcaaaaggcc 
cgttgctggc gtttttccat aggctccgcc 
caagtcagag gtggcgaaac ccgacaggac 
gctccctcgt gcgctctcct gttccgaccc 
tcccttcggg aagcgtggcg ctttctcaa.t 
aggtcgttcg ctccaagctg ggctgtgtgc 
ccttatccgg taactatcgt cttgagtcca 
cagcagccac tggtaacagg attagcagag 
tgaagtggtg gcctaactac ggctacacta 
tgaagccagt taccttcgga aaaagagttg 
ctggtagcgg tggttttttt gtttgcaagc 
aagaagatcc tttgatcttt tctacggggt 
aagggatttt ggtcatgaga ttatcaaaaa 
aatgaagttt taaatcaatc taaagtatat 
gcttaatcag tgaggcacct atctcagcga 
gactccccgt cgtgtagata actacgatac 
caatgatacc gcgagaccca cgctcaccgg 
ccggaagggc cgagcgcaga agtggtcctg 
attgttgccg ggaagctaga gtaagtagtt 
ccattgctac aggcatcgtg gtgtcacgct 
gttcccaacg atcaaggcga gttacatgat 
ccttcggtcc tccgatcgtt gtcagaagta 
tggcagcact gcataattct cttactgtca 
gtgagtactc aaccaagtca ttctgagaat 



accgctacac ttgccagcgc cctagcgccc 3420 
gccacgttcg ccggctttcc ccgtcaagct 3480 
tttagtgctt tacggcacct cgaccccaaa 3540 
gggccatcgc cctgatagac ggtttttcgc 3600 
agtggactct tgttccaaac tggaacaaca 3660 
ttataaggga ttttggggat ttcggcctat 3720 
tttaacgcga attaattctg tggaatgtgt 3780 
ccccaggcag gcagaagtat gcaaagcatg 384 0 
aagtccccag gctccccagc aggcagaagt 3900 
accatagtcc cgcccctaac tccgcccatc 3960 
tctccgcccc atggctgact aatttttttt 4020 
tctgagctat tccagaagta gtgaggaggc 4080 
ctcccgggag cttgtatatc cattttcgga 4140 
cgcatgattg aacaagatgg attgcacgca 4200 
ttcggctatg actgggcaca acagacaatc 4260 
tcagcgcagg ggcgcccggt tctttttgtc 4320 
ctgcaggacg aggcagcgcg gctatcgtgg 4 380 
gtgctcgacg ttgtcactga agcgggaagg 44 40 
caggatctcc tgtcatctca ccttgctcct 4500 
atgcggcggc tgcatacgct tgatccggct 4560 
cgcatcgagc gagcacgtac tcggatggaa 4 620 
gaagagcatc aggggctcgc crccagccgaa 4 680 
gacggcgagg atctcgtcgt gacccatggc 4740 
aatggccgct tttctggatt catcgactgt 4800 
gacatagcgt tggctacccg tgatattgct 4860 
ttcctcgtgc tttacggtat cgccgctccc 4 920 
cttgacgagt tcttctgagc gggactctgg 4980 
acctgccatc acgagatttc gattccaccg 5040 
tcgttttccg ggacgccggc tggatgatcc 5100 
tcgcccaccc caacttgttt attgcagctt 5160 
caaatttcac aaataaagca tttttttcac 5220 
tcaatgtatc ttatcatgtc tgtataccgt 5280 
ggtcatagct gtttcctgtg tgaaattgtt 5340 
ccggaagcat aaagtgtaaa gcctggggtg 5400 
cgttgcgctc actgcccgct ttccagtcgg 54 60 
tcggccaacg cgcggggaga ggcggtttgc 5520 
ctgactcgct gcgctcggtc gttcggctgc 5580 
taatacggtt atccacagaa tcaggggata 5640 
agcaaaaggc caggaaccgt aaaaaggccg 5700 
cccctgacga gcatcacaaa aatcgacgct 5760 
tataaagata ccaggcgttt ccccctggaa 5820 
tgccgcttac cggatacctg tccgcctttc 5880 
gctcacgctg taggtatctc agttcggtgt 5940 
acgaaccccc cgttcagccc gaccgctgcg 6000 
acccggtaag acacgactta tcgccactgg 6060 
cgaggtatgt aggcggtgct acagagttct 6120 
gaaggacagt atttggtatc tgcgctctgc 6180 
gtagctcttg atccggcaaa caaaccaccg 624 0 
agcagattac gcgcagaaaa aaaggatctc 6300 
ctgacgctca gtggaacgaa aactcacgtt 6360 
ggatcttcac ctagatcctt ttaaattaaa 6420 
atgagtaaac ttggtctgac agttaccaat 64 80 
tctgtctatt tcgttcatcc atagttgcct 6540 
gggagggctt accatctggc cccagtgctg 6600 
ctccagattt atcagcaata aaccagccag 6660 
caactttatc cgcctccatc cagtctatta 6720 
cgccagttaa tagtttgcgc aacgttgttg 6780 
cgtcgtttgg tatggcttca ttcagctccg 6840 
cccccatgtt gtgcaaaaaa gcggttagct 6900 
agttggccgc agtgttatca ctcatggtta 6960 
tgccatccgt aagatgcttt tctgtgactg 7020 
agtgtatgcg gcgaccgagt tgctcttgcc 7080 
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cggcgtcaat 
gaaaacgttc 
tgtaacccac 
ggtgagcaaa 
gttgaatact 
tcatgagcgg 
catttccccg 



acgggataat 
ttcggggcga 
tcgtgcaccc 
aacaggaagg 
catactcttc 
atacatattt 
aaaagtgcca 



accgcgccac 
aaactctcaa 
aactgatctt 
caaaatigccg 
ctttttcaat 
gaatg^attt 
cctgacgtc 



atagcagaac 
ggatcttacc 
cagcatcttt 
caaaaaaggg 
attattgaag 
agaaaaataa 



t-ttaaaagtg 
gctgttgaga 
tactttcacc 
aataagggcg 
catttatcag 
acaaataggg 



ctcatcattg 714 0 

tccagttcga 7200 

agcgtttctg 7260 

acacggaaat 7320 

ggttattgtc 7380 

gttccgcgca 74 40 
7469 



<210> 9 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 9 

tgcttaagcg gccgcgaagg agaagtcc 



<210> 10 
<211> 23 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 



<400> 10 

ccgagctagc gactgaaaat gag 



<210> 11 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: prxmer 



<400> 11 

cctctcgaga gacagcaaga cac 



<210> 12 
<211> 11152 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 



<400> 12 

aagcttgggc agaaatggtt gaactcccga 

aaggggttgt ttcccaccaa ggacgacccg 

aaagacatat tcattctctg ctgcaaactt 

ggggaagttg cggttcgtgc tcgcagggct 

gtgcaagatt acaatctaaa caattcggag 

cagccaactt cctcttacaa gccgcatcga 

cttgctaaaa attatatttt taccaataag 

atgttaagaa atgaatcatt atcttttagt 



gagtgtccta cacctagggg agaagcagcc 60 

tctgcgcaca aacggatgag cccatcagac 120 

ggcatagctc tgctttgcct ggggctattg 180 

ctcacccttg actcttttaa tagctcttct 240 

aactcgacct tcctcctgag gcaaggacca 300 

ttttgtcctt cagaaataga aataagaatg 360 

accaatccaa taggtagatt attagttact 420 

actattttta ctcaaattca gaagttagaa 480 
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atgggaatag aaaatagaaa gagacgctca 
ttgaccacag gcctagaagt aaaaaaggga 
aggtggtggc aaccagggac ttatagggga 
ttaccatata caggaagata tgacttaaat 
aaagtgttat atagatccct cccttttcgt 
tgtatgttgt ctcaagaaga aaaagacgac 
ctaggaacag gaatgcactt ttggggaaag 
ggactaatag aacattattc tgcaaaaact 
ttggcccaac cttgcggttc ccagggctta 
aacgaggatg tgagacaagt ggtttcctga 
gctctgagtg ttctattttc ctatgttctt 
ttatgtaaac caagatataa aagagtgctg 
cattcacctc ttgtgtgttt gtgtctgttc 
tcactttcca gagggtcccc ccgcagaccc 
gctggcgccc gaacagggac cctcggataa 
tgtttgtctt gtattgtctc tttcttgtct 
ccatagggac caagctagcg actgaaaatg 
accgaagaaa tggccgccag tcttttggac 
cttccacctc ctagccattt tgaaccacct 
acggcccccg aagatcccaa cgaggaggcg 
ttggcggtgc aggaagggat tgacttactc 
ccgcctcacc tttcccggca gcccgagcag 
atgccaaacc ttgtaccgga ggtgatcgat 
agtgacgacg aggatgaaga gggtgaggag 
cacggttgca ggtcttgtca ttatcaccgg 
tcgctttgct atatgaggac ctgtggcatg 
gtgggtgata gagtggtggg tttggtgtgg 
ggtttaaaga attttgtatt gtgatttttt 
agcccgagcc agaaccggag cctgcaagac 
tcctgagacg cccgacatca cctgtgtcta 
actccggtcc ttctaacaca cctcctgaga 
aaccagttgc cgtgagagtt ggtgggcgtc 
ttaacgagcc tgggcaacct ttggacttga 
acctgtgatt gcgtgtgtgg ttaacgcctt 
taaagggtga gataatgttt aacttgcatg 
atataatgcg ccgtgggcta atcttggtta 
tggaagattt ttctgctgtg cgtaacttgc 
tttggaggtt tctgtggggc tcatcccagg 
acaagtggga atttgaagag cttttgaaat 
tgggtcacca ggcgcttttc caagagaagg 
ggcgcgctgc ggctgctgtt gcttttttga 
cccatctgag cggggggtac ctgctggatt 
tgagacacaa gaatcgcctg ctactgttgt 
aggagcagca gcagcagcag gaggaagcca 
acccgagagc cggcctggac cctcgggaat 
agaactgaga cgcattttga caattacaga 
ggagcggggg gctt<^gagg ctacagagga 
cagacaccgt cctgagtgta ttacttttca 
tgatctgctg gcgcagaagt attccataga 
ggatgatttt gaggaggcta ttagggtata 
gtacaagatc agcaaacttg taaatatcag 
cgaggtggag atagatacgg aggatagggt 
gccgggggtg cttggcatgg acggggtggt 
ttttagcggt acggttttcc tggccaatac 
tgggtttaac aatacctgtg tggaagcctg 
ttactgctgc tggaaggggg tggtgtgtcg 
cctctttgaa aggtgtacct tgggtatcct 
tgtggcctcc gactgtggtt gcttcatgct 
catggtatgt ggcaactgcg aggacagggc 
ctgtcacctg ctgaagacca ttcacgtagc 
tgagcataac atactgaccc gctgttcctt 
accttaccaa tgcaatttga gtcacactaa 
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acctcaattg aagaacaggt gcaaggacta 540 
aaaaagagtg tttttgtcaa aataggagac 600 
ccttacatct acagaccaac agatgccccc 660 
tgggataggt gggttacagt caatggctat 720 
gaaagactcg ccagagctag acctccttgg 780 
atgaaacaac aggtacatga ttatatttat 840 
attttccata ccaaggaggg gacagtggct 900 
catggcatga gttattatga atagccttta 960 
agtaagtttt tggttacaaa ctgttcttaa 1020 
cttggtttgg tatcaaaggt tctgatctga 1080 
ttggaattta tccaaatctt atgtaaatgc 1140 
attttttgag taaacttgca acagtcctaa 1200 
gccatcccgt ctccgctcgt cacttatcct 1260 
cggcgaccct caggtcggcc gactgcggca 1320 
gtgacccttg tctctatttc tactatttgg 1380 
ggctatcatc acaagagcgg aacggactca 1440 
agacatatta tctgccacgg aggtgttatt 1500 
cagctgatcg aagaggtact ggctgataat 1560 
acccttcacg aactgtatga tttagacgtg 1620 
gtttcgcaga tttttcccga ctctgtaatg 1680 
acttttccgc cggcgcccgg ttctccggag 1740 
ccggagcaga gagccttggg tccggtttct 1800 
cttacctgcc acgaggctgg ctttccaccc 18 60 
tttgtgttag attatgtgga gcaccccggg 1920 
aggaatacgg gggacccaga tattatgtgt 1980 
tttgtctaca gtaagtgaaa attatgggca 2040 
taattttttt tttaattttt acagttttgt 2100 
taaaaggtcc tgtgtctgaa cctgagcctg 2160 
ctacccgccg tcctaaaatg gcgcctgcta 2220 
gagaatgcaa tagtagtacg gatagctgtg 2280 
tacacccggt ggtcccgctg tgccccatta 2340 
gccaggctgt ggaatgtatc gaggacttgc 24 00 
gctgtaaacg ccccaggcca taaggtgtaa 24 60 
tgtttgctga atgagttgat gtaagtttaa 2520 
gcgtgttaaa tggggcgggg cttaaagggt 2580 
catctgacct catggaggct tgggagtgtt 2640 
tggaacagag ctctaacagt acctcttggt 2700 
caaagttagt ctgcagaatt aaggaggatt 27 60 
cctgtggtga gctgtttgat tctttgaatc 2820 
tcatcaagac tttggatttt tccacaccgg 2880 
gttttataaa ggataaatgg agcgaagaaa 2940 
ttctggccat gcatctgtgg agagcggttg 3000 
cttccgtccg cccggcgata ataccgacgg 3060 
ggcggcggcg gcaggagcag agcccatgga 3120 
gaatgttgta caggtggctg aactgtatcc 3180 
ggatgggcag gggctaaagg gggtaaagag 3240 
ggctaggaat ctagctttta gcttaatgac 3300 
acagatcaag gataattgcg ctaatgagct 3360 
gcagctgacc acttactggc tgcagccagg 3420 
tgcaaaggtg gcacttaggc cagattgcaa 34 80 
gaattgttgc tacatttctg ggaacggggc 3540 
ggcctttaga tgtagcatga taaatatgtg 3600 
tattatgaat gtaaggttta ctggccccaa 3660 
caaccttatc ctacacggtg taagcttcta 3720 
gaccgatgta agggttcggg gctgtgcctt 3780 
ccccaaaagc agggcttcaa ttaagaaatg 3840 
gtctgagggt aactccaggg tgcgccacaa 3900 
agtgaaaagc gtggctgtga ttaagcataa 3960 
ctctcagatg ctgacctgct cggacggcaa 4020 
cagccactct cgcaaggcct ggccagtgtt 4080 
gcatttgggt aacaggaggg gggtgttcct 4140 
gatattgctt gagcccgaga gcatgtccaa 4200 
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ggtgaacctg aacggggtgt ttgacatgac 
tgagacccgc accaggtgca gaccctgcga 
tgtgatgctg gatgtgaccg aggagctgag 
cgctgagttt ggctctagcg atgaagatac 
cttaagggtg ggaaagaata tataaggtgg 
agcagccgcc gccgccatga gcaccaactc 
gacaacgcgc atgcccccat gggccggggt 
tggtcgcccc gtcctgcccg caaactctac 
gccgttggag actgcagcct ccgccgccgc 
tgtgactgac tttgctttcc tgagcccgct 
ccgcgatgac aagttgacgg ctcttttggc 
tgtcgtttct cagcagctgt tggatctgcg 
ccctcccaat gcggtttaaa acataaataa 
gcaagtgtct tgctgtctct cgagggatct 
cataattgga caaactacct acagagattt 
gtgtataatg tgttaaacta ctgattctaa 
gaactgatga atgggagcag tggtggaatg 
aagaaatgcc atctagtgat gatgaggcta 
aaaagaagag aaaggtagaa gaccccaagg 
gtcatgctgt gtttagtaat agaactcttg 
aagctgcact gctatacaag aaaattatgg 
ataacagtta taatcataac atactgtttt 
ctattaataa ctatgctcaa aaattgtgta 
ataaggaata tttgatgtat agtgccttga 
gtagaggttt tacttgcttt aaaaaacctc 
atgaatgcaa ttgttgttgt taacttgttt 
aatagcatca caaatttcac aaataaagca 
tccaaactca tcaatgtatc ttatcatgtc 
gggtgtggaa agtccccagg ctccccagca 
tagtcagcaa ccaggtgtgg aaagtcccca 
atgcatctca attagtcagc aaccatagtc 
actccgccca gttccgccca ttctccgccc 
gaggccgagg ccgcctcggc ctctgagcta 
ggcctaggct tttgcaaaaa gcttggacac 
accactttat cccgcgtcag ggagaggcag 
actggttttt agtgcgccag atctctataa 
tttttaagcc gtagataaac aggctgggac 
ctgggacatg ttgcagatcc atgcacgtaa 
atggaaaggc attattgccg taagccgtgg 
tgaactgggt attcgtcatg tcgataccgt 
gcgcgagctt aaagtgctga aacgcgcaga 
tgacctggtg gataccggtg gtactgcggt 
ctttgtcacc atcttcgcaa aaccggctgg 
tatcccgcaa gatacctgga ttgaacagcc 
aatctccggt cgctaatctt ttcaacgcct 
caggcgggtt acaatagttt ccagtaagta 
acctgagcga aaccctgttc aaaccccgct 
gccgctttaa tcacggcgca caaccgcctg 
ctcactggta tcgcatgatt aaccgtctga 
aaatcctcga cgtccaggca cgtattgtga 
tatacgatac ggtgattggc taccgtggcg 
tttgtgaagg aaccttactt ctgtggtgtg 
taaagctcta aggtaaatat aaaattttta 
attgtttgtg tattttagat tccaacctat 
gcctttaatg aggaaaacct gttttgctca 
actgctgact ctcaacattc tactcctcca 
gactttcctt cagaattgct aagttttttg 
gcttgctttg ctatttacac cacaaaggaa 
gaaaaatatt ctgtaacctt tataagtagg 
tttcttactc cacacaggca tagagtgtct 
acctttagct ttttaatttg taaaggggtt 
actagagatc ataatcagcc ataccacatt 
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catgaagatc tggaaggtgc tgaggtacga 4260 
gtgtggcggt aaacatatta ggaaccagcc 4320 
gcccgatcac ttggtgctgg cctgcacccg 4380 
agattgaggt actgaaatgt gtgggcgtgg 444 0 
gggtcttatg tagttttgta tctgttttgc 4500 
gtttgatgga agcattgtga gctcatattt 45 60 
gcgtcagaat gtgatgggct ccagcattga 4 620 
taccttgacc tacgagaccg tgtctggaac 4 68 0 
ttcagccgct gcagccaccg cccgcgggat 47 4 0 
tgcaagcagt gcagcttccc gttcatccgc 4 800 
acaattggat tctttgaccc gggaacttaa 48 60 
ccagcaggtt tctgccctga aggcttcctc 4 920 
aaaaccagac tctgtttgga tttggatcaa 4980 
ttgtgaagga accttacttc tgtggtgtga 504 0 
aaagctctaa ggtaaatata aaatttttaa 5100 
ttgtttgtgt attttagatt ccaacctatg 5160 
cctttaatga ggaaaacctg ttttgctcag 5220 
ctgctgactc tcaacattct actcctccaa 528 0 
actttccttc agaattgcta agttttttga 5340 
cttgctttgc tatttacacc acaaaggaaa 54 00 
aaaaatattc tgtaaccttt ataagtaggc 54 60 
ttcttactcc acacaggcat ^gagtgtctg 5520 
cctttagctt tttaatttgt aaaggggtta 558 0 
ctagagatca taatcagcca taccacattt 5640 
ccacacctcc ccctgaacct gaaacataaa 5700 
attgcagctt ataatggtta caaataaagc 57 60 
tttttttcac tgcattctag ttgtggtttg 5820 
tggatccggc tgtggaatgt gtgtcagtta 5880 
ggcagaagta tgcaaagcat gcatctcaat 5 94 0 
ggctccccag caggcagaag tatgcaaagc 6000 
ccgcccctaa ctccgcccat cccgccccta 6060 
catggctgac taattttttt tatttatgca 6120 
ttccagaagt agtgaggagg cttttttgga 6180 
aagacaggct tgcgagatat gtttgagaat 624 0 
tgcgtaaaaa gacgcggact catgtgaaat 6300 
tctcgcgcaa cctattttcc cctcgaacac 6360 
acttcacatg agcgaaaaat acatcgtcac 6420 
actcgcaagc cgactgatgc cttctgaaca 6480 
cggtctggta ccgggtgcgt tactggcgcg 654 0 
ttgtatttcc agctacgatc acgacaacca 6600 
aggcgatggc gaaggcttca tcgttattga 6660 
tgcgattcgt gaaatgtatc caaaagcgca 6720 
tcgtccgctg gttgatgact atgttgttga 6780 
gtgggatatg ggcgtcgtat tcgtcccgcc 6840 
ggcactgccg ggcgttgttc tttttaactt 6900 
ttctggaggc tgcatccatg acacaggcaa 6960 
ttaaacatcc tgaaacctcg acgctagtcc 7020 
tgcagtcggc ccttgatggt aaaaccatcc 7080 
tgtggatctg gcgcggcatt gacccacgcg 714 0 
tgagcgatgc cgaacgtacc gacgatgatt 7200 
gcaactggat ttatgagtgg gccccggatc 7260 
acataattgg acaaactacc tacagagatt 7320 
agtgtataat gtgttaaact actgattcta 7380 
ggaactgatg aatgggagca gtggtggaat 7 440 
gaagaaatgc catctagtga tgatgaggct 7500 
aaaaagaaga gaaaggtaga agaccccaag 7560 
agtcatgctg tgtttagtaa tagaactctt 7 620 
aaagctgcac tgctatacaa gaaaattatg 7680 
cataacagtt ataatcataa catactgttt 7740 
gctattaata actatgctca aaaattgtgt 7800 
aataaggaat atttgatgta tagtgccttg 7860 
tgtagaggtt ttacttgctt taaaaaacct 7 920 
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cccacacctc cccctgaacc tgaaacataa aatgaatgca attgttgttg ttaacttgtt 7980 
tattgcagct tataatggtt acaaataaag caatagcatc acaaatttca caaataaagc 8040 
atttttttca ctgcattcta gttgtggttt gtccaaactc atcaatgtat cttatcatgt 8100 
ctggatcccc aggaagctcc tctgtgtcct cataaaccct aacctcctct acttgagagg 8160 
acattccaat cataggctgc ccatccaccc tctgtgtcct cctgttaatt aggtcactta 8220 
acaaaaagga aattgggtag gggtttttca cagaccgctt tctaagggta attttaaaat 8280 
atctgggaag tcccttccac tgctgtgttc cagaagtgtt ggtaaacagc ccacaaatgt 8340 
caacagcaga aacatacaag ctgtcagctt tgcacaaggg cccaacaccc tgctcatcaa 8400 
gaagcactgt ggttgctgtg ttagtaatgt gcaaaacagg aggcacattt tccccacctg 8460 
tgtaggttcc aaaatatcta gtgttttcat ttttacttgg atcaggaacc cagcactcca 8520 
ctggataagc attatcctta tccaaaacag ccttgtggtc agtgttcatc tgctgactgt 8580 
caactgtagc attttttggg gttacagttt gagcaggata tttggtcctg tagtttgcta 8 640 
acacaccctg cagctccaaa ggttccccac caacagcaaa aaaatgaaaa tttgaccctt 8700 
gaatgggttt tccagcacca ttttcatgag ttttttgtgt ccctgaatgc aagtttaaca 8 7 60 
tagcagttac cccaataacc tcagttttaa cagtaacagc ttcccacatc aaaatatttc 8820 
cacaggttaa gtcctcattt aaattaggca aaggaattct tgaagacgaa agggcctcgt 8880 
gatacgccta tttttatagg ttaatgtcat gataataatg gtttcttaga cgtcaggtgg 8940 
cacttttcgg ggaaatgtgc gcggaacccc tatttgttta tttttctaaa tacattcaaa 9000 
tatgtatccg ctcatgagac aataaccctg ataaatgctt caataatatt gaaaaaggaa 9060 
gagtatgagt attcaacatt tccgtgtcgc ccttattccc ttttttgcgg cattttgcct 9120 
tcctgttttt gctcacccag aaacgctggt gaaagtaaaa gatgctgaag atcagttggg 9180 
tgcacgagtg ggttacatcg aactggatct caacagcggt aagatccttg agagttttcg 9240 
ccccgaagaa cgttttccaa tgatgagcac ttttaaagtt ctgctatgtg gcgcggtatt 9300 
atcccgtgtt gacgccgggc aagagcaact cggtcgccgc atacactatt ctcagaatga 9360 
cttggttgag tactcaccag tcacagaaaa gcatcttacg gatggcatga cagtaagaga 9420 
attatgcagt gctgccataa ccatgagtga taacactgcg gccaacttac ttctgacaac 9480 
gatcggagga ccgaaggagc taaccgcttt tttgcacaac atgggggatc atgtaactcg 9540 
ccttgatcgt tgggaaccgg agctgaatga agccatacca aacgacgagc gtgacaccac 9600 
gatgcctgca gcaatggcaa caacgttgcg caaactatta actggcgaac tacttactct 9660 
agcttcccgg caacaattaa tagactggat ggaggcggat aaagttgcag gaccacttct 9720 
gcgctcggcc cttccggctg gctggtttat tgctgataaa tctggagccg gtgagcgtgg 9780 
gtctcgcggt atcattgcag cactggggcc agatggtaag ccctcccgta tcgtagttat 9840 
ctacacgacg gggagtcagg caactatgga tgaacgaaat agacagatcg ctgagatagg 9900 
tgcctcactg attaagcatt ggtaactgtc agaccaagtt tactcatata tactttagat 9960 
tgatttaaaa cttcattttt aatttaaaag gatctaggtg aagatccttt ttgataatct 10020 
catgaccaaa atcccttaac gtgagttttc gttccactga gcgtcagacc ccgtagaaaa 10080 
gatcaaagga tcttcttgag atcctttttt tctgcgcgta atctgctgct tgcaaacaaa 1014 0 
aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa gagctaccaa ctctttttcc 10200 
gaaggtaact ggcttcagca gagcgcagat accaaatact gtccttctag tgtagccgta 10260 
gttaggccac cacttcaaga actctgtagc accgcctaca tacctcgctc tgctaatcct 10320 
gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt accgggttgg actcaagacg 10380 
atagttaccg gataaggcgc agcggtcggg ctgaacgggg ggttcgtgca cacagcccag 10440 
cttggagcga acgacctaca ccgaactgag atacctacag cgtgagctat gagaaagcgc 10500 
cacgcttccc gaagggagaa aggcggacag gtatccggta agcggcaggg tcggaacagg 10560 
agagcgcacg agggagcttc cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt 10620 
tcgccacctc tgacttgagc gtcgattttt gtgatgctcg tcaggggggc ggagcctatg 10680 
gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc ttttgctggc cttttgctca 1074 0 
catgttcttt cctgcgttat cccctgattc tgtggataac cgtattaccg cctttgagtg 10800 
agctgatacc gctcgccgca gccgaacgac cgagcgcagc gagtcagtga gcgaggaagc 10860 
ggaagagcgc ctgatgcggt attttctcct tacgcatctg tgcggtattt cacaccgcat 10920 
atggtgcact ctcagtacaa tctgctctga tgccgcatag ttaagccagt atacactccg 10980 
ctatcgctac gtgactgggt catggctgcg ccccgacacc cgccaacacc cgctgacgcg 1104 0 
ccctgacggg cttgtctgct cccggcatcc gcttacagac aagctgtgac cgtctccggg 11100 
agctgcatgt gtcagaggtt ttcaccgtca tcaccgaaac gcgcgaggca gc 11152 



<210> 13 
<211> 19 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence: primer 

<400> 13 19 
gacggatcgg gagatctcc 

<210> 14 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 14 22 
ccgcctcaga agccatagag cc 

<210> 15 
<211> 14455 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: plasmid 

aagcttgggc agaaatggtt gaactoccga gagtgtccta cacctagggg agaagcagcc 60 
lllTaaltll ttcccaccaa ggacgaoccg tctgcgcaca aacggatgag cccatcagac 120 
llllllllTt tcattctctg Itgcaaactt ggcatagctc tgctttgcct ^^^^ctattg 10 
aaaaaaattg cggttcgtgc tcgcagggct ctcacccttg actcttttaa tagctcttct z«u 
g?gcaagat? acaatcLL caattcggag aactcgacct tcctcctgag gcaaggacca 300 
llTooltclt cctcttacaa gccgcatcga ttttgtcctt cagaaataga -taagaatg 350 

Tirr.: llllTclll at^ttSag? Tct^t ttt^ ctS^^tca lllllll^tl til 
llllllllll Slatag^aa gagacgct^a acctcaattg aagaacaggt gcaaggacta 540 
ttaaccacaq gcctagaagt aaaaaaggga aaaaagagtg tttttgtcaa aataggagac 600 
iggtggtgjc aaccagggao ttatagggga ccttacatct acagaccaac agatgccccc 660 
ttaccatlta caggaagata tgacttaaat tgggataggt gggttacagt caatggctat 720 
iaagtjttat atagatcoct cccttttcgt gaaagactcg ccagagctag ---tccttgg 780 
tgtatgttgt ctcaagaaga aaaagacgac atgaaacaac aggtacatga ttatatttat B40 
cLggaacIg gaatgcactt ttggggaaag attttcoata =="39^359 g^^^^^^^ct 900 
ggaclaatag aacattattc tgcaaaaact catggcatga gttattatga ^tagccttta 960 
?tggcccaac ottgcggttc ccagggotta agtaagtttt tggttacaaa ^tgttcttaa 1020 
aacaaaaatg tgagacaagt ggtttcctga cttggtttgg tatcaaaggt tctgatctga 1080 
gctctglgtg t?c?attt?c cLtgttctt ttggaattta tccaaatctt ^tgtaaatgc 1140 
?tatg?aaac caagatataa aagagtgctg attttttgag taaacttgca ^"gtcctaa 1200 
cattcacctc ttgtgtgttt gtgtctgttc gccatcccgt ctccgctcgt cacttatcct 1260 
tcac"tcca g^g^tLcc L^cagaccc cggcgaccct caggtcggcc ^-tgcggca 1320 

«ctt llltllllll tftc^Sct llltltllll llllllllT. a^c g t 14J0 

™ra ggSc^^g clg^ra"^ 

cttccacctc c?Igccatt? tgaaccacct acccttcacg aactgtatga tttagacgtg 1620 

aoggcccccg aagltcccaa cgaggaggcg gtttcgcaga tttttcccga ctctgtaatg 1680 

ttggcggtgc aggaagggat tgacttactc acttttccgc cggcgcccqg "ctccggag 1740. 

ccalctcacc tttcccggca gcccgagcag ccggagcaga gagccttggg tccggtttct 1800 

atqccaaacc ttgtaccgga ggtgatcgat cttacctgcc acgaggctgg ctttccaccc 1860 

'J b sbs • 'Ht-- 
^^h^ tinii^^ 
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ggtttaaaga attttgtatt gtgatttttt 
agcccgagcc agaaccggag cctgcaagac 
tcctgagacg cccgacatca cctgtgtcta 
actccggtcc ttctaacaca cctcctgaga 
aaccagttgc cgtgagagtt ggtgggcgtc 
ttaacgagcc tgggcaacct ttggacttga 
acctgtgatt gcgtgtgtgg ttaacgcctt 
taaagggtga gataatgttt aacttgcatg 
atataatgcg ccgtgggcta atcttggtta 
tggaagattt ttctgctgtg cgtaacttgc 
tttggaggtt tctgtggggc tcatcccagg 
acaagtggga atttgaagag cttttgaaat 
tgggtcacca ggcgcttttc caagagaagg 
ggcgcgctg? ggctgctgtt gcttttttga 
cccatctgag cggggggtac ctgctggatt 
tgagacacaa gaatcgcctg ctactgttgt 
aggagcagca gcagcagcag gaggaagcca 
acccgagagc cggcctggac cctcgggaat 
agaactgaga cgcattttga caattacaga 
ggagcggggg gcttgtgagg ctacagagga 
cagacaccgt cctgagtgta ttacttttca 
tgatctgctg gcgcagaagt attccataga 
ggatgatttt gaggaggcta ttagggtata 
gtacaagatc agcaaacttg taaatatcag 
cgaggtggag atagatacgg aggatagggt 
gccgggggtg cttggcatgg acggggtggt 
ttttagcggt acggttttcc tggccaatac 
tgggtttaac aatacctgtg tggaagcctg 
ttactgctgc tggaaggggg tggtgtgtcg 
cctctttgaa aggtgtacct tgggtatcct 
tgtggcctcc gactgtggtt gcttcatgct 
catggtatgt ggcaactgcg aggacagggc 
ctgtcacctg ctgaagacca ttcacgtagc 
tgagcataac atactgaccc gctgttcctt 
accttaccaa tgcaatttga gtcacactaa 
ggtgaacctg aacggggtgt ttgacatgac 
tgagacccgc accaggtgca gaccctgcga 
tgtgatgctg gatgtgaccg aggagctgag 
cgctgagttt ggctctagcg atgaagatac 
cttaagggtg ggaaagaata tataaggtgg 
agcagccgcc gccgccatga gcaccaactc 
gacaacgcgc atgcccccat gggccggggt 
tggtcgcccc gtcctgcccg caaactctac 
gccgttggag actgcagcct ccgccgccgc 
tgtgactgac tttgctttcc tgagcccgct 
ccgcgatgac aagttgacgg ctcttttggc 
tgtcgtttct cagcagctgt tggatctgcg 
ccctcccaat gcggtttaaa acataaataa 
gcaagtgtct tgctgtctct cgagggatct 
cataattgga caaactacct acagagattt 
gtgtataatg tgttaaacta ctgattctaa 
gaactgatga atgggagcag tggtggaatg 
aagaaatgcc atctagtgat gatgaggcta 
aaaagaagag aaaggtagaa gaccccaagg 
gtcatgctgt gtttagtaat agaactcttg 
aagctgcact gctatacaag aaaattatgg 
ataacagtta taatcataac atactgtttt 
ctattaataa ctatgctcaa aaattgtgta 
ataaggaata tttgatgtat agtgccttga 
gtagaggttt tacttgcttt aaaaaacctc 
atgaatgcaa ttgttgttgt taacttgttt 
aatagcatca caaatttcac aaataaagca 
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taaaaggtcc tgtgtctgaa cctgagcctg 2160 
ctacccgccg tcctaaaatg gcgcctgcta 2220 
gagaatgcaa tagtagtacg gatagctgtg 2280 
tacacccggt ggtcccgctg tgccccatta 2340 
gccaggctgt ggaatgtatc gaggacttgc 2400 
gctgtaaacg ccccaggcca taaggtgtaa 24 60 
tgtttgctga atgagttgat gtaagtttaa 2520 
gcgtgttaaa tggggcgggg cttaaagggt 2580 
catctgacct catggaggct tgggagtgtt 2640 
tggaacagag ctctaacagt acctcttggt 2700 
caaagttagt ctgcagaatt aaggaggatt 2760 
cctgtggtga gctgtttgat tctttgaatc 2820 
tcatcaagac tttggatttt tccacaccgg 2880 
gttttataaa ggataaatgg agcgaagaaa 2940 
ttctggccat gcatctgtgg agagcggttg 3000 
cttccgtccg cccggcgata ataccgacgg 3060 
ggcggcggcg gcaggagcag agcccatgga 3120 
gaatgttgta caggtggctg aactgtatcc 3180 
ggatgggcag gggctaaagg gggtaaagag 3240 
ggctaggaat ctagctttta gcttaatgac 3300 
acagatcaag gataattgcg ctaatgagct 3360 
gcagctgacc acttactggc tgcagccagg 3420 
tgcaaaggtg gcacttaggc cagattgcaa 34 60 
gaattgttgc tacatttctg ggaacggggc 3540 
ggcctttaga tgtagcatga taaatatgtg 3600 
tattatgaat gtaaggttta ctggccccaa 3660 
caaccttatc ctacacggtg taagcttcta 3720 
gaccgatgta agggttcggg gctgtgcctt 3780 
ccccaaaagc agggcttcaa ttaagaaatg 3840 
gtctgagggt aactccaggg tgcgccacaa 3 900 
agtgaaaagc gtggctgtga ttaagcataa 3960 
ctctcagatg ctgacctgct cggacggcaa 4020 
cagccactct cgcaaggcct ggccagtgtt 4080 
gcatttgggt aacaggaggg gggtgttcct 4140 
gatattgctt gagcccgaga gcatgtccaa 4200 
catgaagatc tggaaggtgc tgaggtacga 4260 
gtgtggcggt aaacatatta ggaaccagcc 4320 
gcccgatcac ttggtgctgg cctgcacccg 4380 
agattgaggt actgaaatgt gtgggcgtgg 4440 
gggtcttatg tagttttgta tctgttttgc 4500 
gtttgatgga agcattgtga gctcatattt 4560 
gcgtcagaat gtgatgggct ccagcattga 4 620 
taccttgacc tacgagaccg tgtctggaac 4 680 
ttcagccgct gcagccaccg cccgcgggat 4 740 
tgcaagcagt gcagcttccc gttcatccgc 4800 
acaattggat tctttgaccc gggaacttaa 4860 
ccagcaggtt tctgccctga aggcttcctc 4 920 
aaaaccagac tctgtttgga tttggatcaa 4980 
ttgtgaagga accttacttc tgtggtgtga 5040 
aaagctctaa ggtaaatata aaatttttaa 5100 
ttgtttgtgt attttagatt ccaacctatg 5160 
cctttaatga ggaaaacctg ttttgctcag 5220 
ctgctgactc tcaacattct actcctccaa 5280 
actttccttc agaattgcta agttttttga 5340 
cttgctttgc tatttacacc acaaaggaaa 5400 
aaaaatattc tgtaaccttt ataagtaggc 5460. 
ttcttactcc acacaggcat agagtgtctg 5520 
cctttagctt tttaatttgt aaaggggtta 5580 
ctagagatca taatcagcca taccacattt 5640 
ccacacctcc ccctgaacct gaaacataaa 5700 
attgcagctt ataatggtta caaataaagc 5760 
tttttttcac tgcattctag ttgtggtttg 5820 
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tccaaactca tcaatgtatc ttatcatgtc 
gggtgtggaa agtccccagg ctccccagca 
tagtcagcaa ccaggtgtgg aaagtcccca 
atgcatctca attagtcagc aaccatagtc 
actccgccca gttccgccca ttctccgccc 
gaggccgagg ccgcctcggc ctctgagcta 
ggcctaggct tttgcaaaaa gcttggacac 
accactttat cccgcgtcag ggagaggcag 
actggttttt agtgcgccag atctctataa 
tttttaagcc gtagataaac aggctgggac 
ctgggacatg ttgcagatcc atgcacgtaa 
atggaaaggc attattgccg taagccgtgg 
tgaactgggt attcgtcatg tcgataccgt 
gcgcgagctt aaagtgctga aacgcgcaga 
tgacctggt^ gataccggtg gtactgcggt 
ctttgtcacc atcttcgcaa aaccggctgg 
tatcccgcaa gatacctgga ttgaacagcc 
aatctccggt cgctaatctt ttcaacgcct 
caggcgggtt acaatagttt ccagtaagta 
acctgagcga aaccctgttc aaaccccgct 
gccgctttaa tcacggcgca caaccgcctg 
ctcactggta tcgcatgatt aaccgtctga 
aaatcctcga cgtccaggca cgtattgtga 
tatacgatac ggtgattggc taccgtggcg 
tttgtgaagg aaccttactt ctgtggtgtg 
taaagctcta aggtaaatat aaaattttta 
attgtttgtg tattttagat tccaacctat 
gcctttaatg aggaaaacct gttttgctca 
actgctgact ctcaacattc tactcctcca 
gactttcctt cagaattgct aagttttttg 
gcttgctttg ctatttacac cacaaaggaa 
gaaaaatatt ctgtaacctt tataagtagg 
tttcttactc cacacaggca tagagtgtct 
acctttagct ttttaatttg taaaggggtt 
actagagatc ataatcagcc ataccacatt 
cccacacctc cccctgaacc tgaaacataa 
tattgcagct tataatggtt acaaataaag 
atttttttca ctgcattcta gttgtggttt 
ctggatcccc aggaagctcc tctgtgtcct 
acattccaat cataggctgc ccatccaccc 
acaaaaagga aattgggtag gggtttttca 
atctgggaag tcccttccac tgctgtgttc 
caacagcaga aacatacaag ctgtcagctt 
gaagcactgt ggttgctgtg ttagtaatgt 
tgtaggttcc aaaatatcta gtgttttcat 
ctggataagc attatcctta tccaaaacag 
caactgtagc attttttggg gttacagttt 
acacaccctg cagctccaaa ggttccccac 
gaatgggttt tccagcacca ttttcatgag 
tagcagttac cccaataacc tcagttttaa 
cacaggttaa gtcctcattt aaattaggca 
gatacgccta tttttatagg ttaatgtcat 
cacttttcgg ggaaatgtgc gcggaacccc 
tatgtatccg ctcatgagac aataaccctg 
gagtatgagt attcaacatt tccgtgtcgc 
tcctgttttt gctcacccag aaacgctggt 
tgcacgagtg ggttacatcg aactggatct 
ccccgaagaa cgttttccaa tgatgagcac 
atcccgtgtt gacgccgggc aagagcaact 
cttggttgag tactcaccag tcacagaaaa 
attatgcagt gctgccataa ccatgagtga 
gatcggagga ccgaaggagc taaccgcttt 



tggatccggc 
ggcagaagta 
ggctccccag 
ccgcccctaa 
catggctgac 
ttccagaagt 
aagacaggct 
tgcgtaaaaa 
tctcgcgcaa 
acttcacatg 
actcgcaagc 
cggtctggta 
ttgtatttcc 
aggcgatggc 
tgcgattcgt 
tcgtccgctg 
gtgggatatg 
ggcactgccg 
ttctggaggc 
ttaaacatcc 
tgcagtcggc 
tgtggatctg 
tgagcgatgc 
gcaactggat 
acataattgg 
agtgtataat 
ggaactgatg 
gaagaaatgc 
aaaaagaaga 
agtcatgctg 
aaagctgcac 
cataacagtt 
gctattaata 
aataaggaat 
tgtagaggtt 
aatgaatgca 
caatagcatc 
gtccaaactc 
cataaaccct 
tctgtgtcct 
cagaccgctt 
cagaagtgtt 
tgcacaaggg 
gcaaaacagg 
ttttacttgg 
ccttgtggtc 
gagcaggata 
caacagcaaa 
ttttttgtgt 
cagtaacagc 
aaggaattct 
gataataatg 
tatttgttta 
ataaatgctt 
ccttattccc 
gaaagtaaaa 
caacagcggt 
ttttaaagtt 
cggtcgccgc 
gcatcttacg 
taacactgcg 
tttgcacaac 



tgtggaatgt 

tgcaaagcat 

caggcagaag 

ctccgcccat 

taattttttt 

agtgaggagg 

tgcgagatat 

gacgcggact 

cctattttcc 

agcgaaaaat 

cgactgatgc 

ccgggtgcgt 

agctacgatc 

gaaggcttca 

gaaatgtatc 

gttgatgact 

ggcgtcgtat 

ggcgttgttc 

tgcatccatg 

tgaaacctcg 

ccttgatggt 

gcgcggcatt 

cgaacgtacc 

ttatgagtgg 

acaaactacc 

gtgttaaact 

aatgggagca 

catctagtga 

gaaaggtaga 

tgtttagtaa 

tgctatacaa 

ataatcataa 

actatgctca 

atttgatgta 

ttacttgctt 

attgttgttg 

acaaatttca 

atcaatgtat 

aacctcctct 

cctgttaatt 

tctaagggta 

ggtaaacagc 

cccaacaccc 

aggcacattt 

atcaggaacc 

agtgttcatc 

tttggtcctg 

aaaatgaaaa 

ccctgaatgc 

ttcccacatc 

tgaagacgaa 

gtttcttaga 

tttttctaaa 

caataatatt 

ttttttgcgg 

gatgctgaag 

aagatccttg 

ctgctatgtg 

atacactatt 

gatggcatga 

gccaacttac 

atgggggatc 



gtgtcagtta 

gcatctcaat 

tatgcaaagc 

cccgccccta 

tatttatgca 

cttttttgga 

gtttgagaat 

catgtgaaat 

cctcgaacac 

acatcgtcac 

cttctgaaca 

tactggcgcg 

acgacaacca 

tcgttattga 

caaaagcgca 

atgttgttga 

tcgtcccgcc 

tttttaactt 

acacaggcaa 

acgctagtcc 

aaaaccatcc 

gacccacgcg 

gacgatgatt 

gccccggatc 

tacagagatt 

actgattcta 

gtggtggaat 

tgatgaggct 

agaccccaag 

tagaactctt 

gaaaattatg 

catactgttt 

aaaattgtgt 

tagtgccttg 

taaaaaacct 

ttaacttgtt 

caaataaagc 

cttatcatgt 

acttgagagg 

aggtcactta 

attttaaaat 

ccacaaatgt 

tgctcatcaa 

tccccacctg 

cagcactcca 

tgctgactgt 

tagtttgcta 

tttgaccctt 

aagtttaaca 

aaaatatttc 

agggcctcgt 

cgtcaggtgg 

tacattcaaa 

gaaaaaggaa 

cattttgcct 

atcagttggg 

agagttttcg 

gcgcggtatt 

ctcagaatga 

cagtaagaga 

ttctgacaac 

atgtaactcg 



5880 

5940 

6000 

60 60 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 . 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 
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ccttgatcgt tgggaaccgg agctgaatga 
gatgcctgca gcaatggcaa caacgttgcg 
agcttcccgg caacaattaa tagactggat 
gcgctcggcc cttccggctg gctggtttat 
gtctcgcggt atcattgcag cactggggcc 
ctacacgacg gggagtcagg caactatgga 
tgcctcactg attaagcatt ggtaactgtc 
tgatttaaaa cttcattttt aatttaaaag 
catgaccaaa atcccttaac gtgagttttc 
gatcaaagga tcttcttgag atcctttttt 
aaaaccaccg ctaccagcgg tggtttgttt 
gaaggtaact ggcttcagca gagcgcagat 
gttaggccac cacttcaaga actctgtagc 
gttaccagtg gctgctgcca gtggcgataa 
atagttaccg gataaggcgc agcggtcggg 
cttggagcga acgacctaca ccgaactgag 
cacgcttccc gaagggagaa aggcggacag 
agagcgcacg agggagcttc cagggggaaa 
tcgccacctc tgacttgagc gtcgattttt 
gaaaaacgcc agcaacgcgg cctttttacg 
catgttcttt cctgcgttat cccctgattc 
agctgatacc gctcgccgca gccgaacgac 
ggaagagcgc ctgatgcggt attttctcct 
accgcctcag aagccataga gcccaccgca 
tcctccccct tgctgtcctg ccccacccca 
caatgcgatg caatttcctc attttattag 
gtcaaggaag gcacggggga ggggcaaaca 
gaggctgatc agcgagctct agcatttagg 
atgctcgagc ggccgcttct ttattcttgg 
gcaaatattt cattaatgta gttgtggcca 
gcacttggag ttgtgtctcc tgtttcctgt 
ggtttagttt tgtctccgtt taagtaaact 
ccgtgagatt ttggataagc tgataggtta 
gtgccttcag taagatctcc atttctaaag 
tttagtagca ctccattttc gtcaaatctt 
gatattggag ccaaactgcc tttaacagcc 
ccacattttg ttaagaccaa agtgagttta 
ggagatggag ctggtgtggt ccacaaagtt 
atggcacctg tgctgtcaaa actaaggcca 
gaatcaaatt ctaggccatg gccaattttt 
ggtgaaccaa attcaagccc atctcctgca 
aaccccttgg cagtgcttag gttaacctca 
aagtaaaggc ctttgttgta gttaatatcc 
ccctgtccta gtcttagatt tagttggttt 
ataaggcgtc tgttttgaga atcaatcctt 
ccttgtgaat caaaacccaa ggctccagta 
gtcacacctg gaccagttgc tacggtcaaa 
gccccgtact ttagtcctag ttttccattt 
cccaagctac cagtggcagt agttagaggg 
tcggtggtgg tgagggggcc tgatgtttgc 
ggtccttggg tggcaatgct aagtttggag 
atggtgagtg tgttgcccgc gaccattaga 
gaggtaactg tgaggggtgc agatatttcc 
ggtgggctca cagtggttac attttgggag 
ccgttgccca ttttgagcgc aagcatgcca 
aaagagagta ccccaggggg actctcttga 
agaaaaggca cagttggagg accggtttcc 
tcttcagacg gtcttgcgcg cttcatcttg 
catccgccgt ctcaagaccg cctactttaa 
caaccccgac cgccacccgc tgccgcccgc 
ctggttagac gcctttctcg agaggttttc 
cctcggtggc ggagtaccgt tcggaggccg 
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agccatacca aacgacgagc gtgacaccac 9600 
caaactatta actggcgaac tacttactct 9660 
ggaggcggat aaagttgcag gaccacttct 9720 
tgctgataaa tctggagccg gtgagcgtgg 9780 
agatggtaag ccctcccgta tcgtagttat 9840 
tgaacgaaat agacagatcg ctgagatagg 9900 
agaccaagtt tactcatata tactttagat 9960 
gatctaggtg aagatccttt ttgataatct 10020 
gttccactga gcgtcagacc ccgtagaaaa 10080 
tctgcgcgta atctgctgct tgcaaacaaa 10140 
gccggatcaa gagctaccaa ctctttttcc 10200 
accaaatact gtccttctag tgtagccgta 10260 
accgcctaca tacctcgctc tgctaatcct 10320 
gtcgtgtctt accgggttgg actcaagacg 10380 
ctgaacgggg ggttcgtgca cacagcccag 10440 
atacctacag cgtgagctat gagaaagcgc 10500 
gtatccggta agcggcaggg tcggaacagg 10560 
cgcctggtat ctttatagtc ctgtcgggtt 10620 
gtgatgctcg tcaggggggc ggagcctatg 1068O 
gttcctggcc ttttgctggc cttttgctca 10740 
tgtggataac cgtattaccg cctttgagtg lOBOO 
cgagcgcagc gagtcagtga qcgaggaagc 108 60 
tacgcatctg tgcggtattt cacaccgcat 10920 
tccccagcat gcctgctatt gtcttcccaa 10980 
ccccccagaa tagaatgaca cctactcaga 11040 
gaaaggacag tgggagtggc accttccagg 11100 
acagatggct ggcaactaga aggcacagtc 11160 
tgacactata gaatagggcc ctctagatgc 11220 
gcaatgtatg aaaaagtgta agaggatgtg 11280 
gaccagtccc atgaaaatga catagagtat 11340 
gtaccgttta gtgtaatggt tagtgttaca 11400 
tgactgacaa tgttactttt ggcagtttta 114 60 
ggcataaatc caacagcgtt tgtataggct 11520 
ttccaatatt ctgggtccag gaaggaattg 11580 
ataataagat gagcactttg aactgttcca 11640 
aaaactgaaa ctgtagcaag tatttgactg 11700 
gcatctttct ctgcatttag tctacagtta 11760 
agcttatcat tatttttgtt tcctactgta 11820 
gttcctagtt taggaaccat agccttgttt 11880 
gttttgaggg gatttgtgtt tggtgcatta 11940 
ttaatggcta tggctgtagc gtcaaacatc 12000 
agctttttgg aattgtttga agctgtaaac 12060 
aagttgtggg ctgagtttat aaaaagaggg 12120 
tgagcatcaa acggataact aacatcaagt 12180 
agtcctcctg ctacattaag ttgcatattg 12240 
actttagttt gcaaggaagt attattaata 12300 
gtgtttaggt cgtctgttac atgcaaagga 12360 
tgtgtataaa tgggctcttt caagtcaatg 12420 
ggtgaggcag tgatagtaag ggtactgcta 12480 
agggctagct ttccttctga cactgtgagg 1254 0 
tcgtgcacgg ttagcggggc ctgtgattgc 12600 
ggtgcggcgg cagccacagt tagggcttct 12660 
aggtttatgt ttgacttggt ttttttgaga 12720 
gtaaggttgc cggcctcgtc cagagagagg 12780 
ttggaggtaa ctagaggttc ggataggcgc 12840 
aacccattgg gggatacaaa gggaggagta 12900 
gtgtcatatg gatacacggg gttgaaggta 12960 
gatctcaagc ctgccacacc tcacctcgac 13020 
ttacatcatc agcagcacct ccgccagaaa 13080 
cacggtgctc agcctacctt gcgactgtga 13140 
cgatccggtc gatgcggact cgctcaggtc 13200 
acgggtttcc gatccaagag tactggaaag 13260 
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accgcgaaga 
tcggtaccaa 
agtgggttct 
accgcccatt 
ttggtgccaa 
gtcaaaccgc 
gatgactaat 
ataatgccag 
atacacttga 
atggaaagtc 
ggggtcgttg 
catatatggg 
tcaatgtcaa 
aagcagattc 
cgcgcactac 
ggcatcagag 
tctatggtgc 
ccgctatcgc 
gcgccctgac 
gggagctgca 



gtttgtcctc 
gcttgggtct 
ctagttagcc 
tgcgtcaatg 
aacaaactcc 
tatccacgcc 
acgtagatgt 
gcgggccatt 
tgtactgcca 
cctattggcg 
ggcggtcagc 
ctatgaacta 
cgcgtatatc 
ttcatgcaat 
tcagcgacct 
cagattgtac 
actctcagta 
tacgtgactg 
gggcttgtct 
tgtgtcagag 



aaccgcgagc 
ccctatagtg 
agagagctct 
gggcggagtt 
cattgacgtc 
cattgatgta 
actgccaagt 
taccgtcatt 
agtgggcagt 
ttactatggg 
caggcgggcc 
atgaccccgt 
tggcccgtac 
tgtcggtcaa 
ccaacacaca 
tgagagtcga 
caatctgctc 
ggtcatggct 
gctcccggca 
gttttcaccg 



ccaacagcga 
agtcgtatta 
gcttatatag 
gttacgacat 
aatggggtgg 
ctgccaaaac 
aggaaagtcc 
gacgtcaata 
ttaccgtaaa 
aacatacgtc 
atttaccgta 
aattgattac 
atcgcgaagc 
gccttgcctt 
agcagggagc 
ccatagggga 
tgatgccgca 
gcgccccgac 
tccgcttaca 
tcatcaccga 



gctcgaattc 
atttcgataa 
acctcccacc 
tttggaaagt 
agacttggaa 
cgcatcacca 
cataaggtca 

gggggcgtac 

tagtccaccc 
attattgacg 
agttatgtaa 
tattaataac 
agcgcaaaac 
gttgtagctt 
agatactggc 
tcgggagatc 
tagttaagcc 
acccgccaac 
gacaagctgt 
aacgcgcgag 



agatccgagc 
gccagtaagc 
gtacacgcct 
cccgttgatt 
atccccgtga 
tggtaatagc 
tgtactgggc 
ttggcatatg 
attgacgtca 
tcaatgggcg 
cgcggaactc 
tagtcaataa 
gcctaaccct 
aaattttgct 
ttaactatgc 
tcccgatccg 
agtatacact 
acccgctgac 
gaccgtctcc 
gcagc 



13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14455 



<210> 16 
<211> 10610 
<212> PNA 

<:213> Artificial Sequence 



<220> 



<223> Description of Artificial Sequence: piasmid 



<400> 16 

gacggatcgg 

tctatgtcat 

tcctcttaca 

cgtgtttatt 

accaccacat 

caacctgcca 

aaaaagcatc 

ctgtcgagcc 

catgtcgctg 

cggcgaagga 

gcggtggtgc 

atacaacatg 

tgtcctccgg 

cagcaccaca 

ggggaccaca 

cctcataaac 

ccggtaccat 

ggccaaaacc 

gagagcccag 

acacaggcac 

atcccaggga 

cacgtaactc 

cagtatggta 

gcgccgagac 

agtcatattt 

ctcgccgctt 

ggcgccccct 

ccaccaccgc 

cgggaggagc 

aaaacctcaa 

tctacagcca 



gagatccgcg 
tttcatggga 
ctttttcata 
tttcaattgc 
agcttataca 
cctccctccc 
atatcatggg 
aaacgctcat 
tccagctgct 
gaagtccacg 
tgcagcagcg 
gcagtggtct 
gcacagcagc 
atattgttca 
gaacccacgt 
acgctggaca 
ataaacctct 
tgcccgccgg 
gactcgtaac 
acgtgcatac 
acaacccatt 
acgttgtgca 
gcgcgggttt 
aaccgagatc 
cctgaagcaa 
agatcgctct 
ggcttcgggt 
agaataagcc 
gggaagagct 
aatgaagatc 
aagaacagat 



cggtacacag 
ctggtctggc 
cattgcccaa 
agaaaatttc 
gatcaccgta 
aacacacaga 
taacagacat 
cagtgatatt 
gagccacagg 
cctacatggg 
cgcgaataaa 
cctcagcgat 
gcaccctgat 
aaatcccaca 
ggccatcata 
taaacattac 
gattaaacat 
ctatacactg 
catggatcat 
acttcctcag 
cctgaatcag 
ttgtcaaagt 
ctgtctcaaa 
gtgttggtcg 
aaccaggtgc 
gtgtagtagt 
tctatgtaaa 
acacccagcc 
ggaagaacca 
tattaagtga 
aatggcattt 



aattcaggag 
cacaactaca 
gaataaagaa 
aagtcatttt 
ccttaatcaa 
gtacacagtc 
attcttaggt 
aataaactcc 
ctgctgtcca 
ggtagagtca 
ctgctgccgc 
gattcgcacc 
ctcacttaaa 
gtgcaaggcg 
ccacaagcgc 
ctcttttggc 
ggcgccatcc 
cagggaaccg 
catgctcgtc 
gattacaagc 
cgtaaatccc 
gttacattcg 
aggaggtaga 
tagtgtcatg 
gggcgtgaca 
tgtagtatat 
ctccttcatg 
aacctacaca 
tgtttttttt 
acgcgctccc 
gtaagatgtt 



acacaactcc 
ttaatgaaat 
tcgtttgtgt 
tcattcagta 
actcacagaa 
ctttctcccc 
gttatattcc 
ccgggcagct 
acttgcggtt 
taatcgtgca 
cgccgctccg 
gcccgcagca 
tcagcacagt 
ctgtatccaa 
aggtagatta 
atgttgtaat 
accaccatcc 
ggactggaac 
atgatatcaa 
tcctcccgcg 
acactgcagg 
ggcagcagcg 
cgatccctac 
ccaaatggaa 
aacagatctg 
ccactctctc 
cgccgctgcc 
ttcgttctgc 
tttattccaa 
ctccggtggc 
gcacaatggc 



aagtgcatac 
atttgccaca 
tatgtttcaa 
gtatagcccc 
ccctagtatt 
ggctggcctt 
acacggtttc 
cacttaagtt 
gcttaacggg 
tcaggatagg 
tcctgcagga 
taaggcgcct 
aactgcagca 
agctcatggc 
agtggcgacc 
tcaccacctc 
taaaccagct 
aatgacagtg 
tgttggcaca 
ttagaaccat 
gaagacctcg 
gatgatcctc 
tgtacggagt 
cgccggacgt 
cgtctccggt 
aaagcatcca 
ctgataacat 
gagtcacaca 
aagattatcc 
gtggtcaaac 
ttccaaaagg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 
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caaacggccc tcacgtccaa 
ataaacattc cagcaccttc 
atatctctaa gcaaatcccg 
ccctccacct tcagcctcaa 
cctgtataag attcaaaagc 
gcagggccag ctgaacataa 
caggaacctt gacaaaagaa 
ccagcgtagc cccgatgtaa 
ctcaaaaaat caggcaaagc 
agataaaggc aggtaagctc 
atgtctgcgg gtttctgcat 
aagcctgtct tacaacagga 
cggcgtgacc gtaaaaaaac 
tcatgtccgg agtcataatg 
gctaaaaagc gaccgaaata 
acagccccca taggaggtat 
aaaccctcct gcctaggcaa 
cagcggcagc ctaacagtca 
ctcgacacgg caccagctca 
atatatagga ctaaaaa'atg 
cacgcgaacc tacgcccaga 
ttccgttttc ccacgttacg 
tcagtacaat ctgctctgat 
tggaggtcgc tgagtagtgc 
acaattgcat gaagaatctg 
ccagatatac gcgttgacat 
cattagttca tagcccatat 
ctggctgacc gcccaacgac 
taacgccaat agggactttc 
acttggcagt acatcaagtg 
gtaaatggcc cgcctggcat 
agtacatcta cgtattagtc 
atgggcgtgg atagcggttt 
atgggagttt gttttggcac 
ccccattgac gcaaatgggc 
tctggctaac tagagaaccc 
gggagaccca agcttggtac 
gttgaggaca aactcttcgc 
cgaacggtac tccgccaccg 
tctcgagaaa ggcgtctaac 
gcagcgggtg gcggtcgggg 
aggcggtctt gagacggcgg 
aagcgcgcaa gaccgtctga 
ggtcctccaa ctgtgccttt 
agtccccctg gggtactctc 
cttgcgctca aaatgggcaa 
aatgtaacca ctgtgagccc 
tctgcacccc tcacagttac 
gtcgcgggca acacactcac 
cttagcattg ccacccaagg 
tcaggccccc tcaccaccac 
actactgcca ctggtagctt 
aaactaggac taaagtacgg 
gtagcaactg gtccaggtgt 
gccttgggtt ttgattcaca 
gattctcaaa acagacgcct 
ctaaatctaa gactaggaca 
aactacaaca aaggccttta 
aacctaagca ctgccaaggg 
gatgggcttg aatttggttc 
ggccatggcc tagaatttga 
agttttgaca gcacaggtgc 
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gtggacgtaa aggctaaacc cttcagggtg ^atctcctct 1920 
aaccatgccc aaataattct catctcgcca ccttctcaat 1980 
aatattaagt ccggccattg taaaaatctg ctccagagcg 2040 
gcagcgaalc atgattgcaa aaattcaggt ^cctcacaga . 2100 
ggaacattaa caaaaatacc gcgatcccgt aggtcccttc 2160 
tcgtgcaggt ctgcacggac cagcgcggcc acttccccgc 2220 
cccacactga ttatgacacg catactcgga gctatgctaa 2280 
gctttgttgc atgggcggcg atataaaatg caaggtgctg 2340 
ctcgclcaaa aaagaaagca catcgtagtc atgctcatgc 2400 
cggLccacc acagaaaaag acaccatttt tctctcaaac 2460 
aaacacaaaa taaaataaca aaaaaacatt taaacattag 2520 
aaaacaaccc ttataagcat aagacggact acggccatgc 25B0 
tggtcaccgt gattaaaaag caccaccgac agctcctcgg 2640 
taagactcgg taaacacatc aggttgattc atcggtcagt .2700 
gcccggggga atacataccc gcaggcgtag agacaacatt 2760 
Lcaaaatta ataggagaga aaaacacata aacacctgaa 2820 
-aatagcaccc tcccgctcca gaacaacata cagcgcttca 2880 
gccttaccag taaaaaagaa aacctattaa aaaaacacca 2940 
atcagtcaca gtgtaaaaaa gggccaagtg cagagcgagt 3000 
acgtaacggt taaagtccac aaaaaacacc cagaaaaccg 3060 
aacgaaagcc aaaaaaccca caacttcctc aaatcgtcac 3120 
taacttcccg gatcctctcc cgatccccta feggtcgactc 3180 
gccgcatagt taagccagta tctgctccct gcttgtgtgt 3240 
gcgagcaaaa tttaagctac aacaaggcaa ggcttgaccg 3300 
cttagggtta ggcgttttgc gctgcttcgc gatgtacggg 3360 
tgattattga ctagttatta atagtaatca attacggggt 3420 
atggagttcc gcgttacata acttacggta aatggcccgc 3480 
ccccgcccat tgacgtcaat aatgacgtat gttcccatag 3540 
cattgacgtc aatgggtgga ctatttacgg taaactgccc 3600 
tatcatatgc caagtacgcc ccctattgac gtcaatgacg, 3660 
tatgcccagt acatgacctt atgggacttt cctacttggc 3720 
atcgctatta ccatggtgat gcggttttgg cagtacatca 3780 
gactcacggg gatttccaag tctccacccc attgacgtca 3840 
caaaatcaac gggactttcc aaaatgtcgt aacaactccg 3900 
ggtaggcgtg tacggtggga ggtctatata agcagagctc 3960 
actgcttLt ggcttatcga aattaatacg actcactata 4020 
cgagctcgga tctgaattcg agctcgctgt tgggctcgcg 4080 
gqtctttcca gtactcttgg atcggaaacc cgtcggcctc 414 0 
agggacctga gcgagtccgc atcgaccgga tcggaaaacc 4200 
cagtcacagt cgcaaggtag gctgagcacc gtggcgggcg 4260 
ttgtttctgg cggaggtgct gctgatgatg taattaaagt 4320 
atggtcgagg tgaggtgtgg caggcttgag atccaagatg 4380 
agataccttc aaccccgtgt atccatatga cacggaaacc 4 4 40 
tcttactcct ccctttgtat cccccaatgg gtttcaagag 4500 
tttgcgccta tccgaacctc tagttacctc caatggcatg 4560 
cggcctctct ctggacgagg ccggcaacct tacctcccaa 4620 
acctctcaaa aaaaccaagt caaacataaa cctggaaata 4680 
ctcagaagcc ctaactgtgg ctgccgccgc acctctaatg 4740 
catgcaatca caggccccgc taaccgtgca cgactccaaa 4800 
acccctcaca gtgtcagaag gaaagctagc cctgcaaaca 4860 
cgatagcagt acccttacta tcactgcctc accccctcta 4 920 
gggcattgac ttgaaagagc ccatttatac acaaaatgga 4980 
l?ctcct?tg catgtaacag acgacctaaa cactttgacc 5040 
gactattaat aatacttcct tgcaaactaa agttactgga 5100 
aggcaatatg caacttaatg tagcaggagg actaaggatt 5160 
taSacttgat gttagttatc cgtttgatgc tcaaaaccaa 5220 
gggccctctt tttataaact cagcccacaa cttggatatt 5280 
cttgtttaca gcttcaaaca attccaaaaa gcttgaggtt 5340 
gttgatgttt gacgctacag ccatagccat taatgcagga 5400 
LcLatgca ccaaacacaa atcccctcaa aacaaaaatt 54 60 
ttcaaacaag gctatggttc ctaaactagg ^^^^^t 5520 

cattacagta ggaaacaaaa ataatgataa gctaactttg 5580 
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tggaccacac cagctccatc tcctaactgt agactaaatg -gagaaaga tgctaaactc 5640 
ac?ttggtct taacaaaatg tggcagtcaa ^^aottgota ^^gt 9^ tacaagattt 5760 
aaaggcagtt tggctccaat atctggaaca g"caaagtg ^ gaactttaga 5820 

gacgaaaatg gagtgctact aaacaattcc "^^^ggacc ^9 gcctaaccta 5880 

latggagatc ttactgaagg cacagoctat ^^^^^^fctg J^gg ^caagtttac 5940 

tcallttatc caaaatctca =ggtaaaact g=«^^^9ta ^ tacacaggaa 6000 

ttaaacggag acaaaactaa acctgtaaca ^taaccatta || gtctggocac 6060 

acaggagaca caactocaag tgcatactct ^^gtcatttt ^ 9^9 y tgcccaagaa 6120 
aactacatta atgaaatatt tgccacatcc ^cttacactt ttt t^a^ctaaat 6180 

taaagaagcg gccgctcgag 9«^tcta g^gggcccta 9^9 tgttgtttgc 6240 

gctagagctc gctgatcagc ^^^'^^^^f'^ ^^"ccactc ccaclgtcct ttcctaataa 6300 
ccctcccccg tgccttcctt gaccctggaa ^^tgccactc ^ ^ gggtggggtg 6360 
aatgaggaaa .ttgcatcgca ttgtctgagt agS^gtcatt ^tattctggg ^^l^^^^^ ^^^0 
gggcalgaca gcaaggggga ggattgggaa ^^^^^agca S^^^^^ccgg ^^^l^^ g^go 
ggltc?atgg cttctgaggc ggaaagaacc ^^^^ggggct ^taggggg ^^^^^ ^540 

cLtgtagcg gcgcattaag cgoggcgggt ^tggtggtta ^^=9 Igccacgttc 6600 

cttgccagcg ccctagcgcc ^gotcctttc g^^ttcttoc « atttagtgct 6660 

gccggctttc cccgtcaagc tctaaatcgg 99""^=" gtilacgtag tgggccatcg 6720 
Itacggcacc tcgacccoaa a^^^^ttgat "gggtgatg 9^ tagtggactc 6780 

ccctgataga cggtttttcg "9^^9 aScggtct attcttttga tttataaggg 684 0 

ttgttccaaa ctggaacaac --^caaccct ^tctcggtct ^^^^^^^^ .tttaacgcg 6900 
attttqqgga tttcggccta ttggttaaaa aatgagi-i-y .^_„aac,c tccccaggca 6960 
lllLllllt gtggaatgtg tgtcagttag g^tgtggaaa gtccccaggc tec gg^^ ^^^^ 
ggcagaagta tgcaaagcat gcatctcaat tagtcagcaa ^cagg g 99 .^qSO 
g5ctccccag caggcagaag tatgoaaagc ^^gcatctca ^^tag g^ ttctccgccc 7140 
Icgcccctaa ctccgcccat cccgccccta =^^^^9==" ccqcctctgc ctctgagcta 7200 
ca?ggctgac taattttttt tatttatgca 9aggccgagg -gcct^J^ gctcccggga 7260 
ttccagaagt agtgaggagg ^^tttttgga gg^^^aggct 9 ^cgcatgatt 7 320 

gcttgtatat ccattttcgg ^tctgatcaa 9agacaggat 9 9g ^^^^ attcggctat 7380 
gaacaagatg gattgcacgc ^ggttctccg 9ccgcttggg 99^^^^^ gtcagcgcag 744 0 
gactgggcac aacagacaat ^ggctgctct g^tgccgccg ^g 9g 7500 
gggcgcccgg ttctttttgt caagaccgac '^tgtccggtg 9 y tgtgctcgac 7560 

giggcagcgc ggctatcgtg 9Ptggccacg --9ggcgttc -^tgcgcag 9^9 ^^^^^ ^^^0 
gttgtcactg aagcgggaag ggactggctg ^^attgggcg ^^9 9 99^ aatgcggcgg 7 680 
ctgLatctc accttgctcc tgccgagaaa 9tatccatca tggctg 9^ tcgcatcgag 7740 
ctgcatacgc ttgatccggc tacctgccca ttcgaccacc |^9 9 - ^ ^ ^g^Q 

cgagoacgta ctcggatgga agccggtctt g^cgatcagg ^^9a 99 ^ 786O 
ciggggctcg cgccagccga actgttcgcc ^ggctcaagg ^9cgcatg 9^^9.9^^ ^^^o 
gaicicgtcg tgacccatgg cgatgcctgc ttgccgaata Jcatgg gg ^ ^ggo 

?tttctggat tcatcgactg tggccggctg g9^9tggcgg ---9ctat 9g^^^^^^^^ ^^^^ 
ttggctaccc gtgatattgc tgaagagctt g9cg9ogaat 99g 9 9 ^^^^^^^^^g giOO 
ctttacggta tcgccgctcc cgattcgcag ^gcatcgcct ^9 aacctgccat 8160 
ttcttctgag cgggactctg gggttcgaaa tgaccgacca ^9 9 9 ^^^^^^ g220 

cacgagattt cgattccacc gccgccttct atgaaaggtc 999 ttcgcccacc 8280 

gggLgccgg ctggatgatc f c-9-9cg 9ggatctcat g^tggagttc ^^^^^^^^^ 33,^. 
ccaa?:ttgtt tattgcagct tataatggtt ^^^^^^^^^9 J^^^^ atcaatgtat 8400 

caaataaagc atttttttca ctgcattcta 9ttgtggttt 9^c ^ggtcatagc 8460 

cttatcatgt ctgtataccg tcgacctcta gctagagctt 9gcgtaacc 99 ^ ^^^o 
tgtttcctgt gtgaaattgt tatccgctoa "attccaca --|--J^^9a 9ccgg g^^ ^^^^ 
taaagtgtaa agcctggggt gcctaatgag tgagctaact ^acatc 9^ ^^^^c 8640 
cactgcccgc tttccagtcg 99-aacctgt cgtgccagct 9«ttaatg^ actgLtcgc 8700 
gcgcggggag aggcggtttg cgtattgggc 9^tcttocgc Jtc 9 ^^gg^ 8760 

tgcgctcggt cgttcggctg «9gcgagcgg ^atcagctca 9 ^^^^^gg ggzo 

tatccacaga atcaggggat aacgcaggaa ^gaacatgtg 9 |9^ ccccctgacg 8880 
ccaggaaccg taaaaaggcc gcgttgctgg ^9tttttcca ^99 9 ctataaagat 894 0. 
agcatcacaa aaatcgacgc tcaagtcaga 99^99=9^33 ^^^9 9g ^^^^ ^^^^ 

accaggcgtt tccccctgga agctccctcg t9cgotctcc tgtcc g ^ ^ 9060 

ccggltacct gtccgccttt f =--"^9g 9aagcgtggc 9cttt ,,,, 9120 

gtaggtatct cagttcggtg t^g9tcgttc 9Ctccaagct 999 9 aacccggtaa 9180 

Icgttcagcc cgaccgctgc 9==ttatccg 9taactatcg g 9^ gcgaggtatg 9240 
irgg:?gfgc -fgrgftl ^tgrag^ggt T.lllir.ll IggctLact agaaggacag 9300 
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tatttggtat 
gatccggcaa 
cgcgcagaaa 
agtggaacga 
cctagatcct 
cttggtctga 
ttcgttcatc 
taccatctgg 
tatcagcaat 
ccgcctccat 
atagtttgcg 
gtatggcttc 
tgtgcaaaaa 
cagtgttatc 
taagatgctt 
ggcgaccgag 
ctttaaaagt 
cgctgttgag 
ttactttcac 
gaataagggc 
gcatttatca 
aacaaatagg 



ctgcgctctg 
acaaaccacc 
aaaaggatct 
aaactcacgt 
tttaaattaa 
cagttaccaa 
catagttgcc 
ccccagtgct 
aaaccagcca 
ccagtctatt 
caacgttgtt 
attcagctcc 
agcggttagc 
actcatggtt 
ttctgtgact 
ttgctcttgc 
gctcatcatt 
atccagttcg 
cagcgtttct 
gacacggaaa 
gggttattgt 
ggttccgcgc 



ctgaagccag 
gctggtagcg 
caagaagatc 
taagggattt 
aaatgaagtt 
tgcttaatca 
tgactccccg 
gcaatgatac 
gccggaaggg 
aattgttgcc 
gccattgcta 
ggttcccaac 
tccttcggtc 
atggcagcac 
ggtgagtact 
ccggcgtcaa 
ggaaaacgtt 
atgtaaccca 
gggtgagcaa 
tgttgaatac 
ctcatgagcg 
acatttcccc 



-20- 

ttaccttcgg 
gtggtttttt 
ctttgatctt 
tggtcatgag 
ttaaatcaat 
gtgaggcacc 
tcgtgtagat 
cgcgagaccc 
ccgagcgcag 
gggaagctag 
caggcatcgt 
gatcaaggcg 
ctccgatcgt 
tgcataattc 
caaccaagtc 
tacgggataa 
cttcggggcg 
ctcgtgcacc 
aaacaggaag 
tcatactctt 
gatacatatt 
gaaaagtgcc 



aaaaagagtt 
tgtttgcaag 
ttctacgggg 
attatcaaaa 
ctaaagtata 
tatctcagcg 
aactacgata 
acgctcaccg 
aagtggtcct 
agtaagtagt 
ggtgtcacgc 
agttacatga 
tgtcagaagt 
tcttactgtc 
attctgagaa 
taccgcgcca 
aaaactctca 
caactgatct 
gcaaaatgcc 
cctttttcaa 
tgaatgtatt 
acctgacgtc 



ggtagctctt 
cagcagatta 
tctgacgctc 
aggatcttca 
tatgagtaaa 
atctgtctat 
cgggagggct 
gctccagatt 
gcaactttat 
tcgccagtta 
tcgtcgtttg 
tcccccatgt 
aagttggccg 
atgccatccg 
tagtgtatgc 
catagcagaa 
aggatcttac 
tcagcatctt 
gcaaaaaagg 
tattattgaa 
tagaaaaata 



9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10610 



<210> 17 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

<220> ^ . 

<223> Description of Artificial Sequence: Primer 

<400> 17 24 
tgtacaccgg atccggcgca cacc 



<210> 18 
<211> 35 
<212> DNA 

<213> Artificial Sequence 

<220> „ . 

<223> Description of Artificial Sequence: Prxmer 

<400> 18 35 
cacaacgagc tcaattaatt aattgccaca tcctc 



<210> 19 

<211> 4 

<212> PRT 

<213> adenovirus 

<400> 19 
Thr Leu Trp Thr 
1 



<210> 20 

<211> 12 

<212> PRT 

<213> adenovirus 
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<400> 20 

Pro Ser Ala Ser Ala Ser Ala Ser Ala Pro Gly Ser 
15 10 



<210> 21 

<211> 44 

<212> PRT 

<213> adenovirus 




Phe Val Ser Pro Asn Gly Phe Gin Glu Ser Pro Pro 
35 40 



<210> 22 

<211> 43 

<212> PRT 

<213> adenovirus 

<400> 22 

Met Ala Lys Arg Ala Arg Leu Ser 
1 5 

Tyr Glu Asp Glu Ser Ser Ser Gin 
20 

lie Ser Pro Asp Gly Phe Thr Gin 

35 40 



Thr Ser Phe Asn Pro Val Tyr Pro 
10 15 

His Pro Phe lie Asn Pro Gly Phe 
25 30 

Ser Pro Asn 



<210> 23 

<211> 43 

<212> PRT 

<213> adenovirus 




Val Ser Ser Asp Gly Phe Lys Asn Phe Pro Pro 
35 40 



<210> 24 

<211> 42 

<212> PRT 

<213> adenovirus 

Set^Lys^Arg Ala Arg Phe Glu Asp Asp Phe Asn Pro Val Tyr Pro Tyr 
1 5 10 1^ 
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Glu His Tyr Asn Pro Leu Asp lie Pro Phe lie Thr Pro Pro Phe Ala 
20 25 

Ser Ser Asn Gly Leu Gin Glu Lys Pro Pro 
35 40 



<210> 25 

<2X1> 42 

<212> PRT 

<213> adenovirus 



<400> 25 ^ ^ 

Met Lys Arg Thr Arg He Glu Asp 

1 ' 5 

Asp Thr Ser Ser Thr Pro Ser He 
20 

Ser Ser Asp Gly Leu Gin Glu Asn 
35 40 



Asp Phe Asn Pro Val Tyr Pro Tyr 
10 15 

Pro Tyr Val Ala Pro Pro Phe Val 
25 30 

Pro Pro 



<210> 26 
<211> 327 
<212> DNA 
<213> adenovirus 



<400> 26 

agatctgaat tcgagctcgc 
ccagtactct tggatcggaa 
tgagcgagtc cgcatcgacc 
agtcgcaagg taggctgagc 
tggcggaggt gctgctgatg 
aggtgaggtg tggcaggctt 



tgttgggctc gcggttgagg 
acccgtcggc ctccgaacgg 
ggatcggaaa acctctcgag 
accgtggcgg gcggcagcgg 
atgtaattaa agtaggcggt 
gagatct 



acaaactctt cgcggtcttt 60 
tactccgcca ccgagggacc 120 
aaaggcgtct aaccagtcac 180 
gtggcggtcg gggttgtttc 240 
cttgagacgg cggatggtcg 300 



<210> 27 
<211> 32480 
<212> DNA 
<213> adenovirus 



iS i s 

SrJi SS i™ i;s 

~i ii i is ^ - " Hi 
sii S =s ^ ^ s= 

-^^i^. SHI iii i s i~ 5- 
iiSS Hi =s cis ^ i 

aSSS ,L,cclc„ cotct|.,« at.cc.,.., t.,t,.,,., jcttttttg |00 

flsltiisissHSiifssis 
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gaccagaaac 

ggcgagatcg 

cttaatcgcc 

accgatcgcc 

ccggcaccag 

gtcgtcgtcc 

acctatccca 

tcgctcacat. 

gatggcgtta 

gacagtcgtt 

ctcgcggtga 

cggatgagcg 

gatttccatg 

gttcagatgt 

gaaacgcagg 

ggttatgccg 

gaaatcccga 

gaagcagaag 

ctgaacggca 

ggtcaggtca 

tttaacgccg 

cgctacggcc 

aatcgtctga 

gtgcagcgcg 

cacggcgcta 

gtgcagtatg 

gcgcgcgtgg 

ctttcgctac 

aacagtcttg 

ggcggcttcg 

ccgtggtcgg 

aacggtctgg 

cagcagtttt 

ttccgtcata 

gcaagcggtg 

gaactaccgc 

aacgcgaccg 

gaaaacctca 

gaaatggatt 

tttctttcac 

ttcacccgtg 

aacgcctggg 

cagtgcacgg 

catcagggga 

atggcgatta 

ctgaactgcc 

gaaaactatc 

gacatgtata 

ttgaattatg 

caacagcaac 

ctgaatatcg 

tcggcggaat 

taataataac 

tatttaaaaa 

atgggagcct 

agtgatacgg 

tttggtctgc 

aaagcaatag 

gtttgtccaa 

agggtgggaa 

gccgccgccg 

acgcgcatgc 



agcacctcga 

atcccgtcgt 

ttgcagcaca 

cttcccaaca 

aagcggtgcc 

cctcaaactg 

ttacggtcaa 

ttaatgttga 

actcggcgtt 

tgccgtctga 

tggtgctgcg 

gcattttccg 

ttgccactcg 

gcggcgagtt 

tcgccagcgg 

atcgcgtcac 

atctctatcg 

cctgcgatgt 

agccgttgct 

tggatgagca 

tgcgctgttc 

tgtatgtggt 

ccgatgatcc 

atcgtaatca 

atcacgacgc 

aaggcggcgg 

atgaagacca 

ctggagagac 

gcggtttcgc 

tctgggactg 

cttacggcgg 

tctttgccga 

tccagttccg 

gcgataacga 

aagtgcctct 

agccggagag 

catggtcaga 

gtgtgacgct 

tttgcatcga 

agatgtggat 

caccgctgga 

tcgaacgctg 

cagatacact 

aaaccttatt 

ccgttgatgt 

agctggcgca 

ccgaccgcct 

ccccgtacgt 

gcccacacca 

tgatggaaac 

acggtttcca 

tccagctgag 

cgggcaggcc 

acacaaactt 

acttcccgtt 

gtattatttt 

tttctgacaa 

catcacaaat 

actcatcaat 

agaatatata 

ccatgagcac 

ccccatgggc 



actgagccgc 

tttacaacgt 

tccccctttc 

gttgcgcagc 

ggaaagctgg 

gcagatgcac 

tccgccgttt 

tgaaagctgg 

tcatctgtgg 

atttgacctg 

ttggagtgac 

tgacgtctcg 

ctttaatgat 

gcgtgactac 

caccgcgcct 

actacgtctg 

tgcggtggtt 

cggtttccgc 

gattcgaggc 

gacgatggtg 

gcattatccg 

ggatgaagcc 

gcgctggcta 

cccgagtgtg 

gctgtatcgc 

agccgacacc 

gcccttcccg 

gcgcccgctg 

taaatactgg 

ggtggatcag 

tgattttggc 

ccgcacgccg 

•tttatccggg 

gctcctgcac 

ggatgtcgct 

cgccgggcaa 

agccgggcac 

ccccgccgcg 

gctgggtaat 

tggcgataaa 

taacgacatt 

gaaggcggcg 

tgctgatgcg 

tatcagccgg 

tgaagtggcg 

ggtagcagag 

tactgccgcc 

cttcccgagc 

gtggcgcggc 

cagccatcgc 

tatggggatt 

cgccggtcgc 

atgtctgccc 

ttggatgttc 

tttcccgatt 

tgccgctatt 

actcggaact 

ttcacaaata 

gtatcttatc 

aggtgggggt 

caactcgttt 

cggggtgcgt 



gatattgccc 

cgtgactggg 

gccagctggc 

ctgaatggcg 

ctggagtgcg 

ggttacgatg 

gttcccacgg 

ctacaggaag 

tgcaacgggc 

agcgcatttt 

ggcagttatc 

ttgctgcata 

gatttcagcc 

ctacgggtaa 

ttcggcggtig 

aacgtcgaaa 

gaactgcaca 

gaggtgcgga 

gttaaccgtc 

caggatatcc 

aaccatccgc 

aatattgaaa 

ccggcgatga 

atcatctggt 

tggatcaaat 

acggccaccg 

gctgtgccga 

atcctttgcg 

caggcgtttc 

tcgctgatta 

gatacgccga 

catccagcgc 

caaaccatcg 

tggatggtgg 

ccacaaggta 

ctctggctca 

atcagcgcct 

tcccacgcca 

aagcgttggc 

aaacaactgc 

ggcgtaagtg 

ggccattacc 

gtgctgatta 

aaaacctacc 

agcgatacac 

cgggtaaact 

tgttttgacc 

gaaaacggtc 

gacttccagt 

catctgctgc 

ggtggcgacg 

taccattacc 

gtatttcgcg 

ggtttattct 

tggctacatg 

tctctgttct 

tgtttattgc 

aagcattttt 

atgtctggat 

cttatgtagt 

gatggaagca 

cagaatgtga 



agcgtttcaa 

aaaaccctgg 

gtaatagcga 

aatggcgctt 

atcttcctga 

cgcccatcta 

agaatccgac 

gccagacgcg 

gctgggtcgg 

tacgcgccgg 

tggaagatca 

aaccgactac 

gcgctgtact 

cagtttcttt 

aaattatcga 

acccgaaact 

ccgccgacgg 

ttgaaaatgg 

acgagcatca 

tgctgatgaa 

tgtggtacac 

cccacggcat 

gcgaacgcgt 

cgctggggaa 

ctgtcgatcc 

atattatttg 

aatggtccat 

aatacgccca 

gtcagtatcc 

aatatgatga 

acgatcgcca 

tgacggaagc 

aagtgaccag 

cgctggatgg 

aacagttgat 

cagtacgcgt 

ggcagcagtg 

tcccgcatct 

aatttaaccg 

tgacgccgct 

aagcgacccg 

aggccgaagc 

cgaccgctca 

ggattgatgg 

cgcatccggc 

ggctcggatt 

gctgggatct 

tgcgctgcgg 

tcaacatcag 

acgcggaaga 

actcctggag 

agttggtctg 

taaggaaatc 

ttttctttta 

acatcaacca 

cgctattatt 

agcttataat 

ttcactgcat 

ccagatctgg 

tttgtatctg 

ttgtgagctc 

tgggctccag 



cgcgctgtat 

cgttacccaa 

agaggcccgc 

tgcctggttt 

ggccgatact 

caccaacgta 

gggttgttac 

aattattttt 

ttacggccag 

agaaaaccgc 

ggatatgtgg 

acaaatcagc 

ggaggctgaa 

atggcagggt 

tgagcgtggt 

gtggagcgcc 

cacgctgatt 

tctgctgctg 

tcctctgcat 

gcagaacaac 

gctgtgcgac 

^gtgccaatg 

aacgcgaatg 

tgaatcaggc 

ttcccgcccg 

cccgatgtac 

caaaaaatgg 

cgcgatgggt 

ccgtttacag 

aaacggcaac 

gttctgtatg 

aaaacaccag 

cgaatacctg 

taagccgctg 

tgaactgcct 

agtgcaaccg 

gcgtctggcg 

gaccaccagc 

ccagtcaggc 

gcgcgatcag 

cattgaccct 

agcgttgttg 

cgcgtggcag 

tagtggtcaa 

gcggattggc 

agggccgcaa 

gccattgtca 

gacgcgcgaa 

ccgctacagt 

aggcacatgg 

cccgtcagta 

gtgtcaaaaa 

cattatgtac 

cttttttatc 

tatcagcaaa 

ccaaccgctg 

ggttacaaat 

tctagttgtg 

gcgtggctta 

ttttgcagca 

atatttgaca 

cattgatggt 



1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560' 

4620 

4680 

4740 

4800 

4860 
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cgccccgtcc tgcccgcaaa ctctactacc ttgacctacg ^g-cgtgtc tggaacgccg 4920 
t?ggaglctg cagcctccgc cgccgctt.ca |ccgctgcag ccaccgcccg ^^^^ 
actgactttg ctttcctgag «^9cttgca acttaatgtc. 5100 . 

gatgacaagt tgacggctct tttggcacaa ttggattctt ttcctcccct 5160 

gtttctcagc agctgttgga tctgcgccag ^^99"^=^^ gatcaagcaa 5220 

cccaatgcgg tttaaaacat aaataaaaaa ccagactctg ^"^^ ^ ccagcggtct 5280 
gtgtct?g^? gtctttattt aggggttttg cgcgcgoggt aggcccggga c^^g^gg^^^ ^^^^ 
cggtcgttga gggtcctgtg tattttttcc f^g^^^^^g ==tgcag agcttcatgc 54 00 

agatalatgg gcataagccc gtctctgggg ^gg^ggtagc ^^^^J^^^^^ ' ^^^^^^^ 54 go 
ticggggtgg tgttgtagat gatccagtcg tagcaggagc ^^^ggg g g ^^^^^^^^ 5520 
atgtctttca gtagcaagct gattgcoagg ^^"^gccct 5 ^ ctgtattttt 5SB0 
cggttaagct gggatgggtg catacgtggg 9^tatgagat ^ ^^^caccagc 5640 

ag^tggota tgttcccagc catatccctc =99^9^"^^ tlgalggaal tgcgtggaag 5700 
acagtgtatc cggtgcactt g?^^^^"^^ ^catgtagct tag gg^^^ aatgatggca 5760 
aacttggaga cgcccttgtg acctccaaga ttttccatg gtcatagttg 5820 

atgggcccac gggcggcggc ctgggcgaag ^^attcccgg 9 ggtgccagac 5880 

tgttccagga tgagatcgtc ataggccatt "tacaaagc ^^^^g 95 9 ^^gcatttcc 5940 
tgcggtataa tggttccatc cggcccaggg g-^tagttac J^^ca g^^ gjaacggtt 6000 
cacgctttga gttcagatgg gg^^^tcatg tctacctgcg 99^^^ cgacttaccg 6060 
tccggggtag gggagatcag ^tgggaagaa ^gcaggttcc tg g g 9 ^^^^g 6120 

cagccggtgg gcccgtaaat «<^^==tatt ^ccgggcg gactcgcatg 6180 

cagctgccgt catccctgag "999995^^ tcaccgccca gcgaLgcag ttcttgcaag 624 0 
ttttccctga ccaaatccgc cagaaggcgc tcgccgccca 9 9 ^ J gagcgtttga 6300 
gaagcaaagt ttttcaacgg tttgagaccg tccgccgtag 9=^tgctt 9 9^9^^^^^ ^^^^ 
ccaagcagtt ccaggcggtc ccacagctcg 9tcacctgct ^^acgg ^cgt 6420 

atatctcctc gtttcgcggg ttggggcggc "tcgctgta ^9gcag g ^ ^ g^go 
ccagacgggc cagggtcatg tctttccacg 9gcgcagggt JJ^cgtcag 9 9 ggg ^^^^ 
tcacggtgaa ggggtgcgct ccgggctgcg ^9ctggccag 9g g 9 ^ catttgacca 6600 
tgctggtgct gaagcgctgc cggtcttcgc ^^^gcgcgtc 9g 9g^^| cccttggagg 6660 
tggtgtcata gtccagcccc tccgcggcgt g9cccttggc 9=9 gcgagaaata 6720 

allcgccgca cgaggggcag tgcagacttt tgagggcgta 9agcttggg 9^9^9^^ ^^g^. 
ccgattccgg ggagtaggca tccgcgccgc ^ggccccgca 9 9g^^ 9 ^^^^^ 5340 
gccaggtgag ctctggccgt tcggggtcaa ^^^^^^9gt aggctgtccg 6900 

gtttcttacc tctggtttcc atgagccggt 9tccacgctc 9g^9 | tcctcctcgt 6960 
?gtccccgta tacagacttg agaggcctgt cctcgagcgg ^gttccg gg ^^^O 
atagaaactc ggaccactct 9agacaaagg ^^cgcgtcca 9gccag g ^ 9^ ^^^^ 
agtgggaggg gtagcggtcg "gtccacta 9ggggtccac ^cgc gg l ^^.q 
acatgtcgcc ctcttoggca tcaaggaagg tgattggttt 9^ 99 9 ^ tcactctctt 7200 
cgggtgttcc tgaagggggg ctataaaagg 999^99999= 9cgttcg^ ^250 
ccgcatcgct gtctgcgagg gccagctgtt g9ggt9agta ^^ccctc g 9 ^320 

tglcttctgc gctaagattg tcagtttcca -^--9agga gg^^ttga^^ atctttt?gt 7380 
ccgcggtgat gcctttgagg gtggccgcat ccatccggtc 9 ^ gcgatggagc 74 4 0 
tgLIIgctt ggtggcaaac 9acccgtaga g99cgttgga -gcaact^g 9 9 99^9^ ^^^^ 
gcagggtttg gtttttgtcg ^9-tcggcgc g^^ccttggc 9 9 |^ ggcaccaggt 7560 
attcgcgcgc aacgcaccgc cattcgggaa ^9^^99t99^ Ltggtggct acctctccgc 7620 
gcaogcgcca accgcggttg tgcagggtga ^aaggtcaac 9^ 99^9g^^ ggcggtaggg 7680 
gtaggcgctc gttggtccag «g^ggf9g^ ^9cccttg g 9 9 9^^ ggcagcaggc 7740 
ggtctagctg cgtctcgtcc ggggggtctg <=9tccacgg^ cacctgctgc catgcgcggg 7800 
gcgcgtcgaa gtagtctatc ttgcatcctt gcaagtctag ^9=«9Ctgc 9 ^^^^ 
cggclagcgc gcgctcgtat gggttgagtg 9999^^^^" ^?Sgagt at?ccaaga? 7920 
cggaggcgta catgccgcaa atgtcgtaaa cgtagagggg ^^ctccg g ^ggo 
a?gtlgg?ta gcatcttcca ccgcggatgc tggcgcgcac gaatcgtat ^9 9^9J ^^^^ 
agggagcgag gaggtcggga ccgaggttgc ^acgggcggg ^ ^ acgttgaagc 8100 
tl£gcctgaa gatggcatgt gagttggatg ^tatggttgg ^cgctgga 9 9^ 9^ g.^^ 
tggcgtctgt gagacctacc gcgtcacgca «=gaaggaggc g^^gg g 9 ^^^^tgatga 8220 
tgaccagctc ggcggtgacc tgcacgtcta 99gcgcagta gtccaggg^^ ^,ctc?tcgc 8280 
tgtcatactt atcctgtccc ttttttttcc ^cagcccgcg 9 gagcctagca 8340 

ggtctttcca gtactcttgg atcggaaacc cgtcggcotc ^g^acggra 9 9 « ^^^^ 
?gtagaactg gttgacggcc tggtaggcgc agcatccctt ttctacggg JJ^9^^^9 ^^^^ 
cctgcgcggc cttccggagc gaggtgtggg ^gagcgcaaa gg 9 agcaaaaagt 8520 

fttrgaS^ gg-t?ggfa gSga^ggt gaLtcgttg aagag.atc. 8580 
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ttcccgcgcg aggcataaag ttgcgtgtga 
tgttaattac ctgggcggcg agcacgatct 
tgtaaagttc caagaagcgc gggatgccct 
aggtgagctc ttcaggggag ctgagcccgt 
ggttggaagc gacgaatgag ctccacaggt 
gaaaggtcct aaactggcga cctatggcca 
gcgggtcttg ttcccagcgg tcccatccaa 
ctagaggctc atctccgccg aacttcatga 
aggcccccat ccaagtatag gtctctacat 
gatgcgagcc gatcgggaag aactggatct 
tgtggtgaaa gtagaagtcc ctgcgacggg 
gtgcgcagta ctggcagcgg tgcacgggct 
cgcgcacaag gaagcagagt gggaatttga 
cttctacttc ggctgcttgt ccttgaccgt 
ggaccaccac gccgcgcgag cccaaagtcc 
tgacaacatc gcgcagatgg gagctgtcca 
gcgggagctc ctgcaggttt acctcgcata 
gatacctaat ttccaggggc tggttggtgg 
cccgcggcgc gactacggta ccgcgcggcg 
atgcatctaa aagcggtgac gcgggcgagc 
cgggagaggg ggcaggggca cgtcggcgcc 
taggttgctg gcgaacgcga cgacgcggcg 
gaagacgacg ggcccggtga gcttgagcct 
gtcgttgacg gcggcctggc gcaaaatctc 
gatctcggcc atgaactgct cgatctcttc 
cacggtggcg gcgaggtcgt tggaaatgcg 
tccctcgttc cagacgcggc tgtagaccac 
cacctgcgcg agattgagct ccacgtgccg 
aaagaggtag ttgagggtgg tggcggtgtg 
tcgcaacgtg gattcgttga tatcccccaa 
gtccacggcg aagttgaaaa actgggagtt 
aagacggatg agctcggcga cagtgtcgcg 
ttcttcttct tcaatctcct cttccataag 
tgggggaggg gggacacggc ggcgacgacg 
gatcatctcc ccgcggcgac ggcgcatggt 
gcgcagttgg aagacgccgc ccgtcatgtc 
cggcagggat acggcgctaa cgatgcatct 
gagggacctg agcgagtccg catcgaccgg 
ccagtcacag tcgcaaggta ggctgagcac 
gttgtttctg gcggaggtgc tgctgatgat 
gatggtcgac agaagcacca tgtccttggg 
catgccccag gcttcgtttt gacatcggcg 
ttctaccggc acttcttctt ctccttcctc 
ggcggcggcg gagtttggcc gtaggtggcg 
gcccctcatc ggctgaagca gggctaggtc 
ctgcacctgc gtgagggtag actggaagtc 
cgtgttgatg gtgtaagtgc agttggccat 
ctgcgagagc tcggtgtacc tgagacgcga 
gcaagtccgc accaggtact ggtatcccac 
gggccagcgt agggtggccg gggctccggg 
tccgtagatg tacctggaca tccaggtgat 
gtcgcggacg cggttccaga tgttgcgcag 
ctggccggtc aggcgcgcgc aatcgttgac 
agcgggcact cttccgtggt ctggtggata 
gggttcgagc cccgtatccg gccgtccgcc 
aacccaggtg tgcgacgtca gacaacgggg 
ggcggctgct gcgctagctt ttttggccac 
gaaagcgaaa gcattaagtg gctcgctccc 
agtcgcggga cccccggttc gagtctcgga 
ctccccgtca tgcaagaccc cgcttgcaaa 
ttgcttttcc cagatgcatc cggtgctgcg 
agagcaagag cagcggcaga catgcagggc 
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tgcggaaggg tcccggcacc tcggaacggt 8 640 
cgtcaaagcc gttgatgttg tggcccacaa 8700 
tgatggaagg caatttttta agttcctcgt 87 60 
gctctgaaag ggcccagtct gcaagatgag 8820 
cacgggccat tagcatttgc aggtggtcgc 8880 
ttttttctgg ggtgatgcag tagaaggtaa 8940 
ggttcgcggc taggtctcgc gcggcagtca 9000 
ccagcatgaa gggcacgagc tgcttcccaa 9060 
cgtaggtgac aaagagacgc tcggtgcgag 9120 
cccgccacca attggaggag tggctattga 9180 
ccgaacactc gtgctggctt ttgtaaaaac 924 0 
gtacatcctg cacgaggttg acctgacgac 9300 
gcccctcgcc tggcgggttt ggctggtggt 9360 
ctggctgctc gaggggagtt acggtggatc 9420 
agatgtccgc gcgcggcggt cggagcttga 9480 
tggtctggag ctcccgcggc gtcaggtcag 9540 
gacgggtcag ggcgcgggct agatccaggt 9600 
cggcgtcgat ggcttgcaag aggccgcatc 9660 
ggcggtgggc cgcgggggtg tccttggatg 9720 
ccccggaggt agggggggct ccggacccgc 9780 
gcgcgcgggc aggagctggt gctgcgcgcg 9840 
gttgatctcc tgaatctggc ^cctctgcgt 9900 
gaaagagagt tcgacagaat caatttcggt 9960 
ctgcacgtct cctgagttgt cttgataggc 10020 
ctcctggaga tctccgcgtc cggctcgctc 10080 
ggccatgagc tgcgagaagg cgttgaggcc 10140 
gcccccttcg gcatcgcggg cgcgcatgac 10200 
ggcgaagacg gcgtagtttc gcaggcgctg 10260 
ttctgccacg aagaagtaca taacccagcg 10320 
ggcctcaagg cgctccatgg cctcgtagaa 10380 
gcgcgccgac acggttaact cctcctccag 104 40 
cacctcgcgc tcaaaggcta caggggcctc 10500 
ggcctcccct tcttcttctt ctggcggcgg 10560 
gcgcaccggg aggcggtcga caaagcgctc 10 620 
ctcggtgacg gcgcggccgt tctcgcgggg 10 680 
ccggttatgg gttggcgggg ggctgccatg 1074 0 
caacaattgt tgtgtaggta ctccgccgcc 10800 
atcggaaaac ctctcgagaa aggcgtctaa 10860 
cgtggcgggc ggcagcgggc ggcggtcggg 10920 
gtaattaaag taggcggtct tgagacggcg 10980 
tccggcctgc tgaatgcgca ggcggtcggc 1104 0 
caggtctttg tagtagtctt gcatgagcct 11100 
ttgtcctgca tctcttgcat ctatcgctgc 11160 
ccctcttcct cccatgcgtg tgaccccgaa 11220 
ggcgacaacg cgctcggcta atatggcctg 11280 
atccatgtcc acaaagcggt ggtatgcgcc 1134 0 
aacggaccag ttaacggtct ggtgacccgg 11400 
gtaagccctc gagtcaaata cgtagtcgtt 11460 
caaaaagtgc ggcggcggct ggcggtagag 11520 
ggcgagatct tccaacataa ggcgatgata 11580 
gccggcggcg gtggtggagg cgcgcggaaa 11640 
cggcaaaaag tgctccatgg tcgggacgct 11700 
gctctagacc gtgcaaaagg agagcctgta 11760 
aattcgcaag ggtatcatgg cggacgaccg 11820 
gtgatccatg cggttaccgc ccgcgtgtcg 11880 
gagtgctcct tttggcttcc ttccaggcgc 119^0 
tggccgcgcg cagcgtaagc ggttaggctg 12000 
tgtagccgga gggttatttt ccaagggttg 12060 
ccggccggac tgcggcgaac gggggtttgc 12120 
ttcctccgga aacagggacg agcccctttt 12180 
gcagatgcgc ccccctcctc agcagcggca 12240 
accctcccct cctcctaccg cgtcaggagg 12300 
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ggcgacatcc gcggttgacg cggcagcaga tggtgattac gaacccccgc ^5^9--^^!^ ^^360 
l^gjcactac ctggacttgg aggagggcga gsg-tggcg cggctaggag -g^-^ctcc ^^^^^ 
tgagcggtac ccaagggtgc agctgaagcg tgatacgcgt g^g^^^tacg ^^^^^ 
gaalcigttt cgcgaccgcg agggagagga g^ccgaggag ^^gcgggatc g | ^2600 

v:^^ ssis fsn n fnkcH \\^'^ 

s i b b?S?isi^2;s 

gccogagggc cgctggctgc ^cgatttgat --^^-^octg =^g^g g ^3020 
gcgoagcttg ^^^^'^HHl ^ataccaflc cccttacgtt ccoatagaca aggaggtaaa 13080 

sSi iSiSli Sgs sag gglS ii: 
Silri SS gl liii Si5 -SnSi iiS 

cggcggcgtg g-^^^J^^^ ga?ga?gcaa gacgcaacgg acclggcggt gcgggcggcg 13500 
agcggtgatg tttctgatca g^^gacgc ^ ^ ^ ggcgccaggt catggaccgc 13560 

~ Si lii rISS iSi iEii 

Si iiii Is iii isis isiii 

cgggcoo tgggctcc.t 5g"|0«t. 'J^l^^t'^ Jc?,cggct Ltggtg.ct 13980 
gtgocgcggg 5"«9J99; ,g,ce'g'cl atttjttoo. gacc.gt.g. 14040 

ii%T.iiii -SIS , - inir^m ii«s 

sig= £ s ~ s ir.c;rg^:= 1^.11 

miii'^n i5?sr4? s ~ 

^g^ii^ii?^ i«r.?r4i s ™ 

atgcgcgacg gggtaacgcc cagcgtggcg ^^ggacatga ^ g g ^^^^ gcatcgcgcg 14580 
ggcatgtatg cctcaaaccg g«g"^^^^ gcca?cttga acccgcactg gctaccgccc 14 64 0 
gccgoogtga accccgagta "tcaccaat 9=^^"^^^^ acgatggatt cctctgggac 14700 
cctggtttct acaccggggg attcgaggtg «=^^g*ggy" t-octaaactt qcaacagcgc 14760 
gacatagacg acagcgtgtt ttccccgcaa ccgcagaccc ^g^^^^^^" ^ | ^^^^ 

lagcaggcag aggcggcgct gcgaaaggaa «g="ccgca gg^caagcag ^ttg g ^^^^^ 
Itaggcgctg cggccccgcg gtcagatgct ^gtagcccat ^tccaagctt g-tjgg^^^ ^^^^^ 
cttaccagca ctcgcaccao ccgcccgcgc ^tgctgggcg -gg^^^^^^^ caacgggata 15000 
tcgctgctgc agccgcagcg cg^f^^^^^^ !=aacataca cgcaggagca cagggacgtg 15060 
gagagcctag tggacaagat gagtagatgg «^g^=g^|^9 g?cagcglgg tcl|gtgtgg 15120 
ccaggcccgc gcccgcccac ccgtcgtcaa aggcacgacc |^cag gggg ^g^go 
gaggacgatg actcggcaga cgacagcagc g^^^^ggatt ^ggg ggg 9 ^ 1524 0 

tttgcgcacc ttcgccccag gctggggaga ^tgttttaaa mH^HH ^^Jttlgta 15300 
aataaaaaac tcaccaaggc catggcaccg agcgttggtt "cttgtatt ^ go 

tgcggcgcgc ggcgatgtat g^gg-ggtc ctcctccctc ctacgagagt gtggtgj^g ^^^^^ 
cggcgccagt ggcggcggcg <=tgggttctc ^^ttcgacgc totgagttgg 15480 

tgcctccgcg gtacctgcgg "taccgggg ^^^^^^^^^^ ::igtcaacg gatgtggoat 15540 
cacccctatt cgacaocacc cgtgtgtacc ^gg^ggac ^ aacaatgact 15600 

ccctgaacta ccagaacgac "cagcaact ^tctgaccac gg^^ cactggggcg 15660 

acagcccggg ggaggcaagc a<=^"g^°=^ tacoaaatgt gaacgagttc atgtttacca 15720 
gcgacctgaa aaccatcctg "^accaaca tgcoaaatgt 9 ^J^^^ caggtggagc 15780 
ataagtttaa ggcgcgggtg atggtgtcgc g"^g=" ctactccqag accatgacca 15840 
tgaaatacga gtgggtggag "cacgctgc ^cgagggcaa ctactccgag l^^^^ ^^^.^ 

tagaccttat gaacaacgcg atcgtggagc cttcagactg gggtttgacc 15960 

tisrcH' -4r,rj s»r.ir. .,~ticcat ..o^. 
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ttttgctgcc aggatgcggg gtggacttca 
tccgcaagcg gcaacccttc caggagggct 
gtaacattcc cgcactgttg gatgtggacg 
aacagggcgg gggtggcgca ggcggcagca 
ccaacgcggc agccgcggca atgcagccgg 
gcgacacctt tgccacacgg gctgaggaga 
ctgccgcccc cgctgcgcaa cccgaggtcg 
ccctgacaga ggacagcaag aaacgcagtt 
cccagtaccg cagctggtac cttgcataca 
catggaccct gctttgcact cctgacgtaa 
tgccagacat gatgcaagac cccgtgacct 
cggtggtggg cgccgagctg ttgcccgtgc 
tctactccca actcatccgc cagtttacct 
agaaccagat tttggcgcgc ccgccagccc 
ctgctctcafc agatcacggg acgctaccgc 
tgaccattac tgacgccaga cgccgcacct 
tctcgccgcg cgtcctatcg agccgcactt 
ccagcaataa cacaggctgg ggcctgcgct 
agcgctccga ccaacaccca gtgcgcgtgc 
acaaacgcgg ccgcactggg cgcaccaccg 
aggcgcgcaa ctacacgccc acgccgccac 
ccgtggtgcg cggagcccgg cgctatgcta 
gtcgccaccg ccgccgaccc ggcactgccg 
gcgcacgtcg caccggccga cgggcggcca 
ttgtcactgt gccccccagg tccaggcgac 
gtgctatgac tcagggtcgc aggggcaacg 
tgcgcgtgcc cgtgcgcacc cgccccccgc 
actcgtactg ttgtatgtat ccagcggcgg 
aaatcaaaga agagatgctc caggtcatcg 
aagagcagga ttacaagccc cgaaagctaa 
atgatgaact tgacgacgag gtggaactgc 
agtggaaagg tcgacgcgta aaacgtgttt 
ccggtgagcg ctccacccgc acctacaagc 
acctgcttga gcaggccaac gagcgcctcg 
acatgctggc gttgccgctg gacgagggca 
tgcagcaggt gctgcccgcg cttgcaccgt 
ctggtgactt ggcacccacc gtgcagctga 
tcttggaaaa aatgaccgtg gaacctgggc 
agcaggtggc gccgggactg ggcgtgcaga 
gcaccagtat tgccaccgcc acagagggca 
cggtggcgga tgccgcggtg caggcggtcg 
tgcaaacgga cccgtggatg tttcgcgttt 
agtacggcgc cgccagcgcg ctactgcccg 
cccccggcta tcgtggctac acctaccgcc 
ccaccactgg aacccgccgc cgccgtcgcc 
tgcgcagggt ggctcgcgaa ggaggcagga 
ccagcatcgt ttaaaagccg gtctttgtgg 
tccgtttccc ggtgccggga ttccgaggaa 
acggcctgac gggcggcatg cgtcgtgcgc 
gcatgcgcgg cggtatcctg cccctcctta 
tgcccggaat tgcatccgtg gccttgcagg 
tgtggaaaaa tcaaaataaa aagtctggac 
tgtagaatgg aagacatcaa ctttgcgtct 
atgggaaact ggcaagatat cggcaccagc 
tcgctgtgga gcggcattaa aaatttcggt 
tggaacagca gcacaggcca gatgctgagg 
aaggtggtag atggcctggc ctctggcatt 
gtgcaaaata agattaacag taagcttgat 
gccgtggaga cagtgtctcc agaggggcgt 
gaaactctgg tgacgcaaat agacgagcct 
ctgcccacca cccgtcccat cgcgcccatg 
gtaacgctgg acctgcctcc ccccgccgac 
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cccacagccg cctgagcaac ttgttgggca 16080 
ttaggatcac ctacgatgat ctggagggtg 16140 
cct:accaggc gagcttgaaa gatgacaccg 16200 
acagcagtgg cagcggcgcg gaagagaact 16260 
tggaggacat gaacgatcat gccattcgcg 16320 
agcgcgctga ggccgaagca gcggccgaag 16380 
agaagcctca gaagaaaccg gtgatcaaac 164 40 
acaacctaat aagcaatgac agcaccttca 16500 
actacggcga ccctcagacc ggaatccgct 16560 
cctgcggctc ggagcaggtc tactggtcgt 16620 
tccgctccac gcgccagatc agcaactttc 16680 
actccaagag cttctacaac gaccaggccg 16740 
ctctgaccca cgtgttcaat cgctttcccg 16800 
ccaccatcac caccgtcagt gaaaacgttc 168 60 
tgcgcaacag catcggagga gtccagcgag 16920 
gcccctacgt ttacaaggcc ctgggcatag 16980 
tttgagcaag catgtccatc cttatatcgc 17040 
tcccaagcaa gatgtttggc ggggccaaga 17100 
gcgggcacta ccgcgcgccc tggggcgcgc 17160 
tcgatgacgc catcgacgcg gtggtggagg 17220 
cagtgtccac agtggacgcg gccattcaga 17280 
aaatgaagag acggcggagg cgcgtagcac 17340 
cccaacgcgc ggcggcggcc ctgcttaacc 17400 
tgcgggccgc tcgaaggctg gccgcgggta 17 4 60 
gagcggccgc cgcagcagcc gcggccatta 17520 
tgtattgggt gcgcgactcg gttagcggcc 17 580 
gcaactagat tgcaagaaaa aactacttag 17 64 0 
cggcgcgcaa cgaagctatg tccaagcgca 177 00 
cgccggagat ctatggcccc ccgaagaagg 17 760 
agcgggtcaa aaagaaaaag aaagatgatg 17820 
tgcacgctac cgcgcccagg cgacgggtac 17880 
tgcgacccgg caccaccgta gtctttacgc 17 940 
gcgtgtatga tgaggtgtac ggcgacgagg 18000 
gggagtttgc ctacggaaag cggcataagg 18060 
acccaacacc tagcctaaag cccgtaacac 18120 
ccgaagaaaa gcgcggccta aagcgcgagt 18180 
tggtacccaa gcgccagcga ctggaagatg 18240 
tggagcccga ggtccgcgtg cggccaatca 18300 
ccgtggacgt tcagataccc actaccagta 18360 
tggagacaca aacgtccccg gttgcctcag 18420 
ctgcggccgc gtccaagacc tctacggagg 18 480 
cagccccccg gcgcccgcgc ggttcgagga 18540 
aatatgccct acatccttcc attgcgccta 18 600 
ccagaagacg agcaactacc cgacgccgaa 18 660 
gtcgccagcc cgtgctggcc ccgatttccg 18720 
ccctggtgct gccaacagcg cgctaccacc 18780 
ttcttgcaga tatggccctc acctgccgcc 18840 
gaatgcaccg taggaggggc atggccggcc 18 900 
accaccggcg gcggcgcgcg tcgcaccgtc 18960 
ttccactgat cgccgcggcg attggcgccg 19020 
cgcagagaca ctgattaaaa acaagttgca 19080 
tctcacgctc gcttggtcct gtaactattt 19140 
ctggccccgc gacacggctc gcgcccgttc 19200 
aatatgagcg gtggcgcctt cagctggggc 19260 
tccaccgtta agaactatgg cagcaaggcc 19320 
gataagttga aagagcaaaa tttccaacaa 19380 
agcggggtgg tggacctggc caaccaggca 19440 
ccccgccctc ccgtagagga gcctccaccg 19500 
ggcgaaaagc gtccgcgccc cgacagggaa 19560 
ccctcgtacg aggaggcact aaagcaaggc 19620 
gctaccggag tgctgggcca gcacacaccc 19680 
acccagcaga aacctgtgct gccaggcccg 19740 
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accgccgttg ttgtaacccg tcctagccgc 
cgatcgttgc ggcccgtagc cagtggcaac 
ctgggggtgc aatccctgaa gcgccgacga 
tgtcatgtat gcgtccatgt cgccgccaga 
ccaagatggc taccccttcg atgatgccgc 
acgcctcgga gtacctgagc cccgggctgg 
tcagcctgaa taacaagttt agaaacccca 
accggtccca gcgtttgacg ctgcggttca 
cgtacaaggc gcggttcacc ctagctgtgg 
cgtactttga catccgcggc gtgctggaca 
ctgcctacaa cgccctggct cccaagggtg 
ctactgctct tgaaataaac ctagaagaag 
agcaagctga gcagcaaaaa actcacgtat 
ttacaaagga gggtattcaa ataggtgtcg 
catttcaacc tgaacctcaa ataggagaat 
cagctgggag agtccttaaa aagactaccc 
aacccacaaa tgaaaatgga gggcaaggca 
aaagtcaagt ggaaatgcaa tttttctcaa 
acttgactcc taaagtggta ttgtacagtg 
atatttctta catgcccact attaaggaag 
ctatgcccaa caggcctaat tacattgctt 
acaacagcac gggtaatatg ggtgttctgg 
tagatttgca agacagaaac acagagcttt 
atagaaccag gtacttttct atgtggaatc 
gaattattga aaatcatgga actgaagatg 
gtgtgattaa tacagagact cttaccaagg 
gggaaaaaga tgctacaga.a ttttcagata 
ccatggaaat caatctaaat gccaacctgt 
tgtatttgcc cgacaagcta aagrtacagtc 
acacctacga ctacatgaac aagcgagtgg 
accttggagc acgctggtcc cttgactata 
gcaatgctgg cctgcgctac cgctcaatgt 
acatccaggt gcctcagaag ttctttgcca 
acacctacga gtggaacttc aggaaggatg 
atgacctaag ggttgacgga gccagcatta 
tcttccccat ggcccacaac accgcctcca 
acgaccagtc ctttaacgac tatctctccg 
acgctaccaa cgtgcccata tccatcccct 
ccttcacgcg ccttaagact aaggaaaccc 
acacctactc tggctctata ccctacctag 
agaaggtggc cattaccttt gactcttctg 
cccccaacga gtttgaaatt aagcgctcag 
gtaacatgac caaagactgg ttcctggtac 
agggcttcta tatcccagag agctacaagg 
agcccatgag ccgtcaggtg gtggatgata 
tcctacacca acacaacaac tctggatttg 
gacaggccta ccctgctaac ttcccctatc 
ttacccagaa aaagtttctt tgcgatcgca 
ttatgtccat gggcgcactc acagacctgg 
acgcgctaga catgactttt gaggtggatc 
tgtttgaagt ctttgacgtg gtccgtgtgc 
tgtacctgcg cacgcccttc tcggccggca 
aacaacagct gccgccatgg gctccagtga 
tggttgtggg ccatattttt tgggcaccta 
acacaagctc gcctgcgcca tagtcaatac 
gatggccttt gcctggaacc cgcactcaaa 
ttctgaccag cgactcaagc aggtttacca 
cgccattgct tcttcccccg accgctgtat 
ggggcccaac tcggccgcct gtggactatt 
ctggccccaa actcccatgg atcacaaccc 
ctccatgctc aacagtcccc aggtacagcc 
cagcttcctg gagcgccact cgccctactt 
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gcgtccctgc gccgcgccgc cagcggtccg 19800 

tggcaaagca cactgaacag catcgtgggt 198 60 

tgcttctgaa tagctaacgt gtcgtatgtg 19920 

ggagctgctg agccgccgcg cgcccgcttt 19980 

agtggtctta catgcacatc tcgggccagg 2004 0 

tgcagtttgc ccgcgccacc gagacgtact 20100 

cggtggcgcc tacgcacgac gtgaccacag 20160 

tccctgtgga ccgtgaggat actgcgtact 20220 

gtgataaccg tgtgctggac atggcttcca 2028 0 

ggggccctac ttttaagccc tactctggca 2034 0 

ccccaaatcc ttgcgaatgg gatgaagctg 204 00 

aggacgatga caacgaagac gaagtagacg 204 60 

ttgggcaggc gccttattct ggtataaata 2052 0 

aaggtcaaac acctaaatat gccgataaaa 2058 0 

ctcagtggta cgaaactgaa attaatcatg 2064 0 

caatgaaacc atgttacggt tcatatgcaa 20700 

ttcttgtaaa gcaacaaaat ggaaagctag 20760 

ctactgaggc gaccgcaggc aatggtgata 20820 

aagatgtaga tatagaaacc ccagacactc 208 8 0 

gtaactcacg agaactaatg ggccaacaat 2094 0 

ttagggacaa ttttattggt ctaatgtatt 21000 

cgggccaagc atcgcagttg aatgctgttg 21060 

cataccagct tttgcttgat tccattggtg 21120 

aggctgttga cagctatgat ccagatgtta 2118 0 

aacttccaaa ttactgcttt ccactgggag 21240 

taaaacctaa aacaggtcag gaaaatggat 21300 

aaaatgaaat aagagttgga aataattttg 21360 

ggagaaattt cctgtactcc aacatagcgc 21420 

cttccaacgt aaaaatttct gataacccaa 21480 

tggctcccgg gttagtggac tgctacatta 2154 0 

tggacaacgt caacccattt aaccaccacc 21600 

tgctgggcaa tggtcgctat gtgcccttcc 21660 

ttaaaaacct ccttctcctg ccgggctcat 21720 

ttaacatggt tctgcagagc tccctaggaa 21780 

agtttgatag catttgcctt tacgccacct 21840 

cgcttgaggc catgcttaga aacgacacca 21900 

ccgccaacat gctctaccct atacccgcca 21960 

cccgcaactg ggcggctttc cgcggctggg 22020 

catcactggg ctcgggctac gacccttatt 22080 

atggaacctt ttacctcaac cacaccttta 22140 

tcagctggcc tggcaatgac cgcctgctta 22200 

ttgacgggga gggttacaac gttgcccagt 222 60 

aaatgctagc taactacaac attggctacc 22320 

accgcatgta ctccttcttt agaaacttcc 22380 

ctaaatacaa ggactaccaa caggtgggca 224 4 0 

ttggctacct tgcccccacc atgcgcgaag 22500 

cgcttatagg caagaccgca gttgacagca 22560 

ccctttggcg catcccattc tccagtaact 22 620 

gccaaaacct tctctacgcc aactccgccc 22680 

ccatggacga gcccaccctt ctttatgttt 227 4 0 

accggccgca ccgcggcgtc atcgaaaccg 22 800 

acgccacaac ataaagaagc aagcaacatc 22860 

gcaggaactg aaagccattg tcaaagatct 22920 

tgacaagcgc tttccaggct ttgtttctcc 22980 

ggccggtcgc gagactgggg gcgtacactg 23040 

aacatgctac ctctttgagc cctttggctt 2310,0 

gtttgagtac gagtcactcc tgcgccgtag 23160 

aacgctggaa aagtccaccc aaagcgtaca 23220 

ctgctgcatg tttctccacg cctttgccaa 23280 

caccatgaac cttattaccg gggtacccaa 2334 0 

caccctgcgt cgcaaccagg aacagctcta 234 00 

ccgcagccac agtgcgcaga ttaggagcgc 234 60 
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cacttctttt tgtcacttga aaaacatgta 
aggcaaatgc ttttatttgt acactctcgg 
cgccgtttaa aaatcaaagg ggttctgccg 
gttgcgatac tggtgtttag tgctccactt 
ggtgaagttt tcactccaca ggctgcgcac 
cgatatcttg aagtcgcagt tggggcctcc 
gttgcagcac tggaacacta tcagcgccgg 
ggagatcaga tccgcgtcca ggtcctccgc 
tagctgcctt cccaaaaagg gcgcgtgccc 
catcaaaagg tgaccgtgcc cggtctgggc 
gatctgctta aaagccacct gagcctttgc 
gccggaaaac tgattggccg gacaggccgc 
ggagatctgc accacatttc ggccccaccg 
ctccttcagc gcgcgctgcc cgttttcgct 
atttatcata' atgcttccgt gtagacactt 
cagccacaac gcgcagcccg tgggctcgtg 
caggtacgcc tgcaggaatc gccccatcat: 
cagctgcaac ccgcggtgct cctcgttcag 
cacttggtca ggcagtagtt tgaagttcgc 
catcagcgcg cgcgcagcct ccatgccctt 
cgggttcatc accgtaattt cactttccgc 
ccgcatacca cgcgccactg ggtcgtcttc 
tttgccatgc ttgattagca ccggtgggtt 
ttctctttct tcctcgctgt ccacgattac 
agaagggcgc ttctttttct tcttgggcgc 
ccgcgggctg ggtgtgcgcg gcaccagcgc 
ctcgatacgc cgcctcatcc gcttttttgg 
cggggacgac acgtcctcca tggttggggg 
ggtggtttcg cgctgctcct cttcccgact 
gatcatggag tcagtcgaga agaaggacag 
cgcctccacc gatgccgcca acgcgcctac 
ggaggaggaa gtgattatcg agcaggaccc 
ctcagtacca acagaggata aaaagcaaga 
agtcgggcgg ggggacgaaa ggcatggcga 
gaagcatctg cagcgccagt gcgccattat 
gcccctcgcc atagcggatg tcagccttgc 
accccccaaa cgccaagaaa acggcacatg 
cgtatttgcc gtgccagagg tgcttgccac 
acccctatcc tgccgtgcca accgcagccg 
cgctgtcata cctgatatcg cctcgctcaa 
acgcgacgag aagcgcgcgg caaacgctct 
ctctggagtg ttggtggaac tcgagggtga 
catcgaggtc acccactttg cctacccggc 
agtcatgagt gagctgatcg tgcgccgtgc 
agaacaaaca gaggagggcc tacccgcagt 
aacgcgcgag cctgccgact tggaggagcg 
taccgtggag cttgagtgca tgcagcggtt 
agaggaaaca ttgcactaca cctttcgaca 
caacgtggag ctctgcaacc tggtctccta 
gcaaaacgtg cttcattcca cgctcaaggg 
cgtttactta tttctatgct acacctggca 
ggaggagtgc aacctcaagg agctgcagaa 
gacggccttc aacgagcgct ccgtggccgc 
cctgcttaaa accctgcaac agggtctgcc 
ctttaggaac tttatcctag agcgctcagg 
tagcgacttt gtgcccatta agtaccgcga 
ccttctgcag ctagccaact accttgccta 
tgacggtcta ctggagtgtc actgtcgctg 
ttgcaattcg cagctgctta acgaaagtca 
ctcgcctgac gaaaagtccg cggctccggg 
ggcttacctt cgcaaatttg tacctgagga 
agaccaatcc cgcccgccaa atgcggagct 



-29- 

aaaataatgt actagagaca ctttcaataa 23520 
gtgattattt acccccaccc ttgccgtctg 23580 
cgcatcgcta tgcgccactg gcagggacac 23640 
aaactcaggc acaaccatcc gcggcagctc 23700 
catcaccaac gcgtttagca ggtcgggcgc 237 60 
gccctgcgcg cgcgagttgc gatacacagg 23820 
gtggtgcacg ctggccagca cgctcttgtc 23880 
gttgctcagg gcgaacggag tcaactttgg 23940 
aggctttgag ttgcactcgc accgtagtgg 24000 
gttaggatac agcgcctgca taaaagcctt 24060 
gccttcagag aagaacatgc cgcaagactt 24120 
gtcgtgcacg cagcaccttg cgtcggtgtt 24180 
gttcttcacg atcttggcct tgctagactg 2424 0 
cgtcacatcc atttcaatca cgtgctcctt 24 300 
aagctcgcct tcgatctcag cgcagcggtg 24360 
atgcttgtag gtcacctctg caaacgactg 24420 
cgtcacaaag gtcttgttgc tggtgaaggt 24480 
ccaggtcttg catacggccg ccagagcttc 24540 
ctttagatcg ttatccacgt ggtacttgtc 24 600 
ctcccacgca gacacgatcg gcacactcag 24 660 
ttcgctgggc tcttcctctt cctcttgcgt 24720 
attcagccgc cgcactgtgc g^cttacctcc 24780 
gctgaaaccc accatttgta gcgccacatc 24840 
ctctggtgat ggcgggcgct cgggcttggg 24 900 
aatggccaaa tccgccgccg aggtcgatgg 24 960 
gtcttgtgat gagtcttcct cgtcctcgga 25020 
gggcgcccgg ggaggcggcg gcgacgggga 25080 
acgtcgcgcc gcaccgcgtc cgcgctcggg 25140 
ggccatttcc ttctcctata ggcagaaaaa 25200 
cctaaccgcc ccctctgagt tcgccaccac 25260 
caccttcccc gtcgaggcac ccccgcttga 25320 
aggttttgta agcgaagacg acgaggaccg 25380 
ccaggacaac gcagaggcaa acgaggaaca 254 40 
ctacctagat gtgggagacg acgtgctgtt 25500 
ctgcgacgcg ttgcaagagc gcagcgatgt 25560 
ctacgaacgc cacctattct caccgcgcgt 25620 
cgagcccaac ccgcgcctca acttctaccc 25680 
ctatcacatc tttttccaaa actgcaagat 25740 
agcggacaag cagctggcct tgcggcaggg 25800 
cgaagtgcca aaaatctttg agggtcttgg 25860 
gcaacaggaa aacagcgaaa atgaaagtca 25920 
caacgcgcgc ctagccgtac taaaacgcag 25980 
acttaaccta ccccccaagg tcatgagcac 26040 
gcagcccctg gagagggatg caaatttgca 26100 
tggcgacgag cagctagcgc gctggcttca 26160 
acgcaaacta atgatggccg cagtgctcgt 26220 
ctttgctgac ccggagatgc agcgcaagct 26280 
gggctacgta cgccaggcct gcaagatctc 26340 
ccttggaatt ttgcacgaaa accgccttgg 26400 
cgaggcgcgc cgcgactacg tccgcgactg 264 60 
gacggccatg ggcgtttggc agcagtgctt 26520 
actgctaaag caaaacttga aggacctatg 26580 
gcacctggcg gacatcattt tccccgaacg 2664 0 
agacttcacc agtcaaagca tgttgcagaa 26700 
aatcttgccc gccacctgct gtgcacttcc 267 60 
atgccctccg ccgctttggg gccactgcta 268 20 
ccactctgac ataatggaag acgtgagcgg 26880 
caacctatgc accccgcacc gctccctggt 26940 
aattatcggt acctttgagc tgcagggtcc 27000 
gttgaaactc actccggggc tgtggacgtc 270 60 
ctaccacgcc cacgagatta ggttctacga 27120 
taccgcctgc gtcattaccc agggccacat 27180 
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tcttggccaa ttgcaagcca tcaacaaagc ccgccaagag tttctgctac gaaagggacg 27240 
gggggtttac ttggaccccc agtccggcga ggagctcaac ccaatccccc cgccgccgca 27300 
gccctatcag cagcagccgc gggcccttgc ttcccaggat ggcacccaaa aagaagctgc 27360 
agctgccgcc gccacccacg gacgaggagg aatactggga cagtcaggca gaggaggttt 27 420 
tggacgagga ggaggaggac atgatggaag actgggagag cctagacgag gaagcttccg 27 4 80 
aggtcgaaga ggtgtcagac gaaacaccgt caccctcggt cgcattcccc tcgccggcgc 27540 
cccagaaatc ggcaaccggt tccagcatgg ctacaacctc cgctcctcag gcgccgccgg 27600 
cactgcccgt tcgccgaccc aaccgtagat gggacaccac tggaaccagg gccggtaagt 27660 
ccaagcagcc gccgccgtta gcccaagagc aacaacagcg ccaaggctac cgctcatggc 27720 
gcgggcacaa gaacgccata gttgcttgct tgcaagactg tgggggcaac atctccttcg 27780 
cccgccgctt tcttctctac catcacggcg tggccttccc ccgtaacatc ctgcattact 27840 
accgtcatct ctacagccca tactgcaccg gcggcagcgg cagcggcagc aacagcagcg 27900 
gccacacaga agcaaaggcg accggatagc aagactctga caaagcccaa gaaatccaca 27960 
gcggcggcag cagcaggagg aggagcgctg cgtctggcgc ccaacgaacc cgtatcgacc 28020 
cgcgagctta gaaacaggat ttttcccact ctgtatgcta tatttcaaca gagcaggggc 28080 
caagaacaag agctgaaaat aaaaaacagg tctctgcgat ccctcacccg cagctgcctg 28140 
tatcacaaaa gcgaagatca gcttcggcgc acgctggaag acgcggaggc tctcttcagt 28200 
aaatactgcg cgctgactct taaggactag tttcgcgccc tttctcaaat ttaagcgcga 28260 
aaactacgtc atctccagcg gccacacccg gcgccagcac ctgtcgtcag cgccattatg 28320 
agcaaggaaa ttcccacgcc ctacatgtgg agttaccagc cacaaatggg acttgcggct 28380 
ggagctgccc aagactactc aacccgaata aactacatga gcgcgggacc ccacatgata 28440 
tcccgggtca acggaatccg cgcccaccga aaccgaattc tcttggaaca ggcggctatt 2B500 
accaccacac ctcgtaataa ccttaatccc cgtagttggc ccgctgccct ^gtgtaccag 28560 
gaaagtcccg ctcccaccac tgtggtactt cccagagacg cccaggccga agttcagatg 28 620 
actaactcag gggcgcagct tgcgggcggc tttcgtcaca gggtgcggtc gcccgggcag 28 680 
ggtataactc acctgacaat cagagggcga ggtattcagc tcaacgacga gtcggtgagc 287 40 
tcctcgcttg gtctccgtcc ggacgggaca tttcagatcg gcggcgccgg ccgtccttca 28800 
ttcacgcctc gtcaggcaat cctaactctg cagacctcgt cctctgagcc gcgctctgga 288 60 
ggcattggaa ctctgcaatt tattgaggag tttgtgccat cggtctactt taaccccttc 28920 
tcgggacctc ccggccacta tccggatcaa tttattccta actttgacgc ggtaaaggac 28980 
tcggcggacg gctacgactg aatgttaagt ggagaggcag agcaactgcg cctgaaacac 29040 
ctggtccact gtcgccgcca caagtgcttt gcccgcgact ccggtgagtt ttgctacttt 29100 
gaattgcccg aggatcatat cgagggcccg gcgcacggcg tccggcttac cgcccaggga 29160 
gagcttgccc gtagcctgat tcgggagttt acccagcgcc ccctgctagt tgagcgggac 29220 
aggggaccct gtgttctcac tgtgatttgc aactgtccta accttggatt acatcaagat 29280 
ttaattaatt gccacatcct cttacacttt ttcatacatt gcccaagaat aaagaatcgt 29340 
ttgtgttatg tttcaacgtg tttatttttc aattgcagaa aatttcaagt catttttcat 29400 
tcagtagtat agccccacca ccacatagct tatacagatc accgtacctt aatcaaactc 294 60 
acagaaccct agtattcaac ctgccacctc cctcccaaca cacagagtac acagtccttt 29520 
ctccccggct ggccttaaaa agcatcatat catgggtaac agacatattc ttaggtgtta 29580 
tattccacac ggtttcctgt cgagccaaac gctcatcagt gatattaata aactccccgg 29640 
gcagctcact taagttcatg tcgctgtcca gctgctgagc cacaggctgc tgtccaactt 29700 
gcggttgctt aacgggcggc gaaggagaag tccacgccta catgggggta gagtcataat 29760 
cgtgcatcag gatagggcgg tggtgctgca gcagcgcgcg aataaactgc tgccgccgcc 29820 
gctccgtcct gcaggaatac aacatggcag tggtctcctc agcgatgatt cgcaccgccc 29880 
gcagcataag gcgccttgtc ctccgggcac agcagcgcac cctgatctca cttaaatcag 29940 
cacagtaact gcagcacagc accacaatat tgttcaaaat cccacagtgc aaggcgctgt 30000 
atccaaagct catggcgggg accacagaac ccacgtggcc atcataccac aagcgcaggt 30060 
agattaagtg gcgacccctc ataaacacgc tggacataaa cattacctct tttggcatgt 30120 
tgtaattcac cacctcccgg taccatataa acctctgatt aaacatggcg ccatccacca 30180 
ccatcctaaa ccagctggcc aaaacctgcc cgccggctat acactgcagg gaaccgggac 30240 
tggaacaatg acagtggaga gcccaggact cgtaaccatg gatcatcatg ctcgtcatga 30300 
tatcaatgtt ggcacaacac aggcacacgt gcatacactt cctcaggatt acaagctcct 30360 
cccgcgttag aaccatatcc cagggaacaa cccattcctg aatcagcgta aatcccacac 30420 
tgcagggaag acctcgcacg taactcacgt tgtgcattgt caaagtgtta cattcgggca 30480 
gcagcggatg atcctccagt atggtagcgc gggtttctgt ctcaaaagga ggtagacgat 30540 
ccctactgta cggagtgcgc cgagacaacc gagatcgtgt tggtcgtagt gtcatgccaa 3060D 
atggaacgcc ggacgtagtc atatttcctg aagcaaaacc aggtgcgggc gtgacaaaca 30660 
gatctgcgtc tccggtctcg ccgcttagat cgctctgtgt agtagttgta gtatatccac 30720 
tctctcaaag catccaggcg ccccctggct tcgggttcta tgtaaactcc ttcatgcgcc 30780 
gctgccctga taacatccac caccgcagaa taagccacac ccagccaacc tacacattcg 30840 
ttctgcgagt cacacacggg aggagcggga agagctggaa gaaccatgtt ttttttttta 30900 
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ttccaaaaga 
ggtggcgtgg 
aatggcttcc 
agggtgaatc 
tcgccacctt 
aatctgctcc 
tcaggttcct 
tcccgtaggt 
gcggccactt 
ctcggagcta 
aaaatigcaag 
gtagtcatgc 
catttttctc 
aacatttaaa 
cggactacgg 
accgacagct 
tgattcatcg 
gcgtagagac 
cacataaaca 
aacatacagc 
tatitaaaaaa 
caagtgcaga 
aacacccaga 
ttcctcaaat 
caattcccaa 
cgccccgcgc 
aaggtatatt 



ttatccaaaa 
tcaaactcta 
aaaaggcaaa 
tcctctataa 
ctcaatatat 
agagcgccct 
cacagacctg 
cccttcgcag 
ccccgccagg 
tgctaaccag 
gtgctgctca 
tcatgcagat 
tcaaacatgt 
cattagaagc 
ccatgccggc 
cctcggtcat 
gtcagtgcta 
aacattacag 
cctgaaaaac 
gcttcacagc 
acaccactcg 
gcgagtatat 
aaaccgcacg 
cgtcacttcc 
cacatacaag 
cacgtcacaa 
attgatgatg 



cctcaaaatg 
cagccaaaga 
cggccctcac- 
acattccagc 
ctctaagcaa 
ccaccttcag 
tataagattc 
ggccagctga 
aaccttgaca 
cgtagccccg 
aaaaatcagg 
aaaggcaggt 
ctgcgggttt 
ctgtcttaca 
gtgaccgtaa 
gtccggagtc 
aaaagcgacc 
cccccatagg 
cctcctgcct 
ggcagcctaa 
acacggcacc 
ataggactaa 
cgaacctacg 
gttttcccac 
ttactccgcc 
actccacccc 



aagatctatt 
acagataatg 
gtccaagtgg 
accttcaacc 
atcccgaata 
cctcaagcag 
aaaagcggaa 
acataatcgt 
aaagaaccca 
atgtaagctt 
caaagcctcg 
aagctccgga 
ctgcataaac 
acaggaaaaa 
aaaaactggt 
ataatgtaag 
gaaatagccc 
aggtataaca 
aggcaaaatia 
cagtcagcct 
agctcaatca 
aaaatgacgt 
cccagaaacg 
gttacgtaac 
ctaaaaccta 
ctcattatca 



aagtgaacgc 
gcatttgtaa 
acgtaaaggc 
atgcccaaat 
ttaagtccgg 
cgaatcatga 
cattaacaaa 
gcaggtctgc 
cactgattat 
tgttgcatgg 
cgcaaaaaag 
accaccacag 
acaaaataaa 
caacccttat 
caccgtgatt 
actcggtaaa 
gggggaatac 
aaattaatag 
gcaccctccc 
taccagtaaa 
gtcacagtgt 
aacggttaaa 
aaagccaaaa 
ttcccatttt 
cgtcacccgc 
tattggcttc 



gctcccctcc 
gatgttgcac 
taaacccttc 
aattctcatc 
ccattgtaaa 
ttgcaaaaat 
aataccgcga 
acggaccagc 
gacacgcata 
gcggcgatat 
aaagcacatc 
aaaaagacac 
ataacaaaaa 
aagcataaga 
aaaaagcacc 
cacatcaggt 
atacccgcag 
gagagaaaaa 
gctccagaac 
aaagaaaacc 
aaaaaagggc 
gtccacaaaa 
^acccacaac 
aagaaaacta 
cccg ttccca 
aatccaaaat 



30960 
31020 
31080 
31140 
31200 
31260 
31320 
31380 
31440 
31500 
31560 
31620 
31680 
31740 
31800 
31860 
31920 
31980 
32040 
32100 
32160 
32220 
32280 
32340 
32400 
32460 
32480 



<210> 28 
<211> 25 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; primer 



<400> 28 

ctcaacaatt gtggatccgt actcc 



25 



<210> 29 
<211> 25 
<212> DMA 

<213> /artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 29 

gtgctcagca gatcttgcga ctgtg 25 

<210> 30 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 30 
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ggcgcgttcg gatccactct cttcc 



25 



<210> 31 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 31 

ctacatgcta ggcagatctc gttcggag 



28 



<210> 32 
<211> 1240 
<212> DNA 
<213> adenovirus 



<400> 32 

ggatccactc 

ctgaaaagcg 

gatattcacc 

gacaatcttt 

cttggcgatg 

gtttagctgc 

gtcgggcacc 

ggctacctct 

gaatggcggt 

cccgggcagc 

ctgccatgcg 

ggggtgggtg 

gagtattcca 

gtatagttcg 

tgctcggaag 

gaagacgttg 

gtcgcgcagc 

ggtttccttg 

gacaaactct 

agatccgtac 

tctcgagaaa 



tcttccgcat 
ggcatgactt 
tggcccgcgg 
ttgttgtcaa 
gagcgcaggg 
acgtattcgc 
aggtgcacgc 
ccgcgtaggc 
agggggtcta 
aggcgcgcgt 
cgggcggcaa 
agcgcggagg 
agatatgtag 
tgcgagggag 
actatctgcc 
aagctggcgt 
ttgttgacca 
atgatgtcat 
tcgcggtctt 
tccgccgccg 
ggcgtctaac 



cgctgtctgc 
ctgcgctaag 
tgatgccttt 
gcttggtggc 
tttggttttt 
gcgcaacgca 
gccaaccgcg 
gctcgttggt 
gctgcgtctc 
cgaagtagtc 
gcgcgcgctc 
cgtacatgcc 
ggtagcatct 
cgaggaggtc 
tgaagatggc 
ctgtgagacc 
gctcggcggt 
acttatcctg 
tccagtactc 
agggacctga 
cagtcacagt 



gagggccagc 
attgtcagtt 
gagggtggcc 
aaacgacccg 
gtcgcgatcg 
ccgccattcg 
gttgtgcagg 
ccagcagagg 
gtccgggggg 
tatcttgcat 
gtatgggttg 
gcaaatgtcg 
tccaccgcgg 
gggaccgagg 
atgtgagttg 
taccgcgtca 
gacctgcacg 
tccctttttt 
ttggatcgga 
gcgagtccgc 
cgcaagatct 



tgttggggtg 
tccaaaaacg 
gcatccatct 
tagagggcgt 
gcgcgctcct 
ggaaagacgg 
gtgacaaggt 
cggccgccct 
tctgcgtcca 
ccttgcaagt 
agtgggggac 
taaacgtaga 
atgctggcgc 
ttgctacggg 
gatgatatgg 
cgcacgaagg 
tctagggcgc 
ttccacagct 
aacccgtcgg 
atcgaccgga 



^gtactccct 
^ggaggattt 
ggtcagaaaa 
tggacagcaa 
tggccgcgat 
tggtgcgctc 
caacgctggt 
tgcgcgagca 
cggtaaagac 
ctagcgcctg 
cccatggcat 
ggggctctct 
gcacgtaatc 
cgggctgctc 
ttggacgctg 
aggcgtagga 
agtagtccag 
cgcggttgag 
cctccgaacg 
tcggaaaacc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1240 



<210> 33 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 33 

ggcgcgttcg gatccactct cttcc 



25 



<210> 34 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence: primer 
<400> 34 

gggagtagat ctcccaacag 

<210> 35 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 35 

cccttttttt tggatccctc gcgg 



<210> 36 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 36 

ctacatgcta ggcagatctc gttcggag 

<210> 37 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 37 

ctcaacaatt gttggatccg tactcc 



<210> 38 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 38 

gtgctcagca gatcttgcga ctgtg 



<210> 39 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
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<400> 39 

ggcgcgttcg gatccactct cttcc 



25 



<210> 40 
<211> 28 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 40 

ctacatgcta ggcagatctc gttcggag 



28 



<210> 41 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 41 

cccttttttt tggatccctc gcgg 



24 



<210> 42 
<211> 25 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 42 

gtgctcagca gatcttgcga ctgtg 



25 



<210> 43 
<211> 8383 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 



<400> 43 

gacggatcgg 

ccgcatagtt 

cgagcaaaat 

ttagggttag 

gattattgac 

tggagttccg 

cccgcccatt 

attgacgtca 

atcatatgcc 

atgcccagta 

tcgctattac 

actcacgggg 

aaaatcaacg 



gagatctccc 
aagccagtat 
ttaagctaca 
gcgttttgcg 
tagttattaa 
cgttacataa 
gacgtcaata 
atgggtggac 
aagtacgccc 
catgacctta 
catggtgatg 
atttccaagt 
ggactttcca 



gatcccctat 
ctgctccctg 
acaaggcaag 
ctgcttcgcg 
tagtaatcaa 
cttacggtaa 
atgacgtatg 
tatttacggt 
cctattgacg 
tgggactttc 
cggttttggc 
ctccacccca 
aaatgtcgta 



ggtcgactct 
cttgtgtgtt 
gcttgaccga 
atgtacgggc 
ttacggggtc 
atggcccgcc 
ttcccatagt 
aaactgccca 
tcaatgacgg 
ctacttggca 
agtacatcaa 
ttgacgtcaa 
acaactccgc 



cagtacaatc 
ggaggtcgct 
caattgcatg 
cagatatacg 
attagttcat 
tggctgaccg 
aacgccaata 
cttggcagta 
taaatggccc 
gtacatctac 
tgggcgtgga 
tgggagtttg 
cccattgacg 



tgctctgatg 60 
gagtagtgcg 120 
aagaatctgc 180 
cgttgacatt 240 
agcccatata 300 
cccaacgacc 360 
gggactttcc 420 
catcaagtgt 480 
gcctggcatt 540 
gtattagtca 600 
tagcggtttg 660 
ttttggcacc 720 
caaatgggcg 780 
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gtaggcgtgt 
ctgcttactg 
gagctcggat 
ctccctctga 
ggatttgata 
agaaaagaca 
cagcaacttg 
cgcgatgttt 
gcgctcgtcg 
gctggtggct 
cgagcagaat 
aaagaccccg 
cgcctgctgc 
tggcatgggg 
ctctctgagt 
gtaatcgtat 
ctgctctgct 
acgctggaag 
gtaggagtcg 
gtccagggtt 
gttgaggaca 
cgaacgagat 
aaaacctctc 
aagaccgtct 
aactgtgcct 
tggggtactc 
caaaatgggc 
cactgtgagc 
cctcacagtt 
caacacactc 
tgccacccaa 
cctcaccacc 
cactggtagc 
actaaagtac 
tggtccaggt 
ttttgattca 
aaacagacgc 
aagactagga 
caaaggcctt 
cactgccaag 
tgaatttggt 
cctagaattt 
cagcacaggt 
accagctcca 
cttaacaaaa 
tttggctcca 
tggagtgcta 
tcttactgaa 
tccaaaatct 
agacaaaact 
cacaactcca 
taatgaaata 
gcggccgctc 
ctcgctgatc 
ccgtgccttc 
aaattgcatc 
acagcaaggg 
tggcttctga 
gcggcgcatt 
gcgccctagc 
ttccccgtca 
acctcgaccc 



acggtgggag 
gcttatcgaa 
ccactctctt 
aaagcgggca 
ttcacctggc 
atctttttgt 
gcgatggagc 
agctgcacgt 
ggcaccaggt 
acctctccgc 
ggcggtaggg 
ggcagcaggc 
catgcgcggg 
tgggtgagcg 
attccaagat 
agttcgtgcg 
cggaagacta 
acgttgaagc 
cgcagcttgt 
tccttgatga 
aactcttcgc 
ccgtactccg 
gagaaaggcg 
gaagatacct 
tttcttactc 
tctttgcgcc 
aacggcctct 
ccacctctca 
acctcagaag 
accatgcaat 
ggacccctca 
accgatagca 
ttgggcattg 
ggggctcctt 
gtgactatta 
caaggcaata 
cttatacttg 
cagggccctc 
tacttgttta 
gggttgatgt 
tcacctaatg 
gattcaaaca 
gccattacag 
tctcctaact 
tgtggcagtc 
atatctggaa 
ctaaacaatt 
ggcacagcct 
cacggtaaaa 
aaacctgtaa 
agtgcatact 
tttgccacat 
gagcatgcat 
agcctcgact 
cttgaccctg 
gcattgtctg 
ggaggattgg 
ggcggaaaga 
aagcgcggcg 
gcccgctcct 
agctctaaat 
caaaaaactt 



gtctatataa 
attaatacga 
ccgcatcgct 
tgacttctgc 
ccgcggtgat 
tgtcaagctt 
gcagggtttg 
attcgcgcgc 
gcacgcgcca 
gtaggcgctc 
ggtctagctg 
gcgcgtcgaa 
cggcaagcgc 
cggaggcgta 
atgtagggta 
agggagcgag 
tctgcctgaa 
tggcgtctgt 
tgaccagctc 
tgtcatactt 
ggtctttcca 
ccgccgaggg 
tctaaccagt 
tcaaccccgt 
ctccctttgt 
tatccgaacc 
ctctggacga 
aaaaaaccaa 
ccctaactgt 
cacaggcccc 
cagtgtcaga 
gtacccttac 
acttgaaaga 
tgcatgtaac 
ataatacttc 
tgcaacttaa 
atgttagtta 
tttttataaa 
cagcttcaaa 
ttgacgctac 
caccaaacac 
aggctatggt 
taggaaacaa 
gtagactaaa 
aaatacttgc 
cagttcaaag 
ccttcctgga 
atacaaacgc 
ctgccaaaag 
cactaaccat 
ctatgtcatt 
cctcttacac 
ctagagggcc 
gtgccttcta 
gaaggtgcca 
agtaggtgtc 
gaagacaata 
accagctggg 
ggtgtggtgg 
ttcgctttct 
cggggcatcc 
gattagggtg 



gcagagctct 
ctcactatag 
gtctgcgagg 
gctaagattg 
gcctttgagg 
ggtggcaaac 
gtttttgtcg 
aacgcaccgc 
accgcggttg 
gttggtccag 
cgtctcgtcc 
gtagtctatc 
gcgctcgtat 
catgccgcaa 
gcatcttcca 
gaggtcggga 
gatggcatgt 
gagacctacc 
ggcggtgacc 
atcctgtccc 
gtactcttgg 
acctgagcga 
cacagtcgca 
gtatccatat 
atcccccaat 
tctagttacc 
ggccggcaac 
gtcaaacata 
ggctgccgcc 
gctaaccgtg 
aggaaagcta 
tatcactgcc 
gcccatttat 
agacgaccta 
cttgcaaact 
tgtagcagga 
tccgtttgat 
ctcagcccac 
caattccaaa 
agccatagcc 
aaatcccctc 
tcctaaacta 
aaataatgat 
tgcagagaaa 
tacagtttca 
tgctcatctt 
cccagaatat 
tgttggattt 
taacattgtc 
tacactaaac 
ttcatgggac 
tttttcatac 
ctattctata 
gttgccagcc 
ctcccactgt 
attctattct 
gcaggcatgc 
gctctagggg 
ttacgcgcag 
tcccttcctt 
ctttagggtt 
atggttcacg 



ctggctaact 
ggagacccaa 
gccagctgtt 
tcagtttcca 
gtggccgcat 
gacccgtaga 
cgatcggcgc 
cattcgggaa 
tgcagggtga 
cagaggcggc 
ggggggtctg 
ttgcatcctt 
gggttgagtg 
atgtcgtaaa 
ccgcggatgc 
ccgaggttgc 
gagttggatg 
gcgtcacgca 
tgcacgtcta 
ttttttttcc 
atcggaaacc 
gtccgcatcg 
agatccaaga 
gacacggaaa 
gggtttcaag 
tccaatggca 
cttacctccc 
aacctggaaa 
gcacctctaa 
cacgactcca 
gccctgcaaa 
tcaccccctc 
acacaaaatg 
aacactttga 
aaagttactg 
ggactaagga 
gctcaaaacc 
aacttggata 
aagcttgagg 
attaatgcag 
aaaacaaaaa 
ggaactggcc 
aagctaactt 
gatgctaaac 
gttttggctg 
attataagat 
tggaacttta 
atgcctaacc 
agtcaagttt 
ggtacacagg 
tggtctggcc 
attgcccaag 
gtgtcaccta 
atctgttgtt 
cctttcctaa 
ggggggtggg 
tggggatgcg 
gtatccccac 
cgtgaccgct 
tctcgccacg 
ccgatttagt 
tagtgggcca 



agagaaccca 840 
gcttggtacc 900 
ggggtgagta 960 
aaaacgagga 1020 
ccatctggtc 1080 
gggcgttgga 1140 
gctccttggc 1200 
agacggtggt 1260 
caaggtcaac 1320 
cgcccttgcg 1380 
cgtccacggt 1440 
gcaagtctag 1500 
ggggacccca 1560 
cgtagagggg 1620 
tggcgcgcac 1680 
tacgggcggg 1740 
atatggttgg 1800 
cgaaggaggc 1860 
gggcgcagta 1920 
acagctcgcg 1980 
cgtcggcctc 2040 
accggatcgg 2100 
tgaagcgcgc 2160 
ccggtcctcc 2220 
agagtccccc 2280 
tgcttgcgct 234 0 
aaaatgtaac 2400 
tatctgcacc 24 60 
tggtcgcggg 2520 
aacttagcat 2580 
catcaggccc 2640 
taactactgc 2700. 
gaaaactagg 2760 
ccgtagcaac 2820 
gagccttggg 2880 
ttgattctca 2940 
aactaaatct 3000 
ttaactacaa 30 60 
ttaacctaag 3120 
gagatgggct 318 0 
ttggccatgg 324 0 
ttagttttga 3300 
tgtggaccac 33 60 
tcactttggt 34 20 
ttaaaggcag 34 80 
ttgacgaaaa 3540 
gaaatggaga 3600 
tatcagctta 3660 
acttaaacgg 3720 
aaacaggaga 3780 
acaactacat 384 0 
aataaaagaa 3900 
aatgctagag 3960 
tgcccctccc 4020 
taaaatgagg 4080 
gtggggcagg 4140 
gtgggctcta 4200 
gcgccctgta 4260 
acacttgcca 4320 
ttcgccggct 4380 
gctttacggc 44 4 0 
tcgccctgat 4500 
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agacggtttt 
aaactggaac 
ggatttcggc 
tctgtggaat 
gtatgcaaag 
cagcaggcag 
taactccgcc 
gactaatttt 
agtagtgagg 
tatccatttt 
atggattgca 
cacaacagac 
cggttctttt 
cgcggctatc 
ctgaagcggg 
ctcaccttgc 
cgcttgatcc 
gtactcggat 
tcgcgccagc 
tcgtgaccca 
gattcatcga 
cccgtgatat 
gtatcgccgc 
gagcgggact 
tttcgattcc 
cggctggatg 
gtttattgca 
agcatttttt 
tgtctgtata 
tgtgtgaaat 
taaagcctgg 
cgctttccag 
gagaggcggt 
ggtcgttcgg 
agaatcaggg 
ccgtaaaaag 
caaaaatcga 
gtttccccct 
cctgtccgcc 
tctcagttcg 
gcccgaccgc 
cttatcgcca 
tgctacagag 
tatctgcgct 
caaacaaacc 
aaaaaaagga 
cgaaaactca 
Gcttttaaat 
tgacagttac 
atccatagtt 
tggccccagt 
aataaaccag 
catccagtct 
gcgcaacgtt 
ttcattcagc 
aaaagcggtt 
atcactcatg 
cttttctgtg 
gagttgctct 
agtgctcatc 
gagatccagt 
caccagcgtt 



tcgccctttg 
aacactcaac 
ctattggtta 
gtgtgtcagt 
catgcatctc 
aagtatgcaa 
catcccgccc 
ttttatttat 
aggctttttt 
cggatctgat 
cgcaggttct 
aatcggctgc 
tgtcaagacc 
gtggctggcc 
aagggactgg 
tcctgccgag 
ggctacctgc 
ggaagccggt 
cgaactgttc 
tggcgatgcc 
ctgtggccgg 
tgctgaagag 
tcccgattcg 
ctggggttcg 
accgccgcct 
atcctccagc 
gcttataatg 
tcactgcatt 
ccgtcgacct 
tgttatccgc 
ggtgcctaat 
tcgggaaacc 
ttgcgtattg 
ctgcggcgag 
gataacgcag 
gccgcgttgc 
cgctcaagtc 
ggaagc t ccc 
tttctccctt 
gtgtaggtcg 
tgcgccttat 
ctggcagcag 
ttcttgaagt 
ctgctgaagc 
accgctggta 
tctcaagaag 
cgttaaggga 
taaaaatgaa 
caatgcttaa 
gcctgactcc 
gctgcaatga 
ccagccggaa 
attaattgtt 
gttgccattg 
tccggttccc 
agctccttcg 
gttatggcag 
actggtgagt 
tgcccggcgt 
attggaaaac 
tcgatgtaac 
tctgggtgag 



acgttggagt 
cctatctcgg 
aaaaatgagc 
tagggtgtgg 
aattagtcag 
agcatgcatc 
ctaactccgc 
gcagaggccg 
ggaggcctag 
caagagacag 
ccggccgctt 
tctgatgccg 
gacctgtccg 
acgacgggcg 
ctgctattgg 
aaagtatcca 
ccattcgacc 
cttgtcgatc 
gccaggctca 
tgcttgccga 
ctgggtgtgg 
cttggcggcg 
cagcgcatcg 
aaatgaccga 
tctatgaaag 
gcggggatct 
gttacaaata 
ctagttgtgg 
ctagctagag 
tcacaattcc 
gagtgagcta 
tgtcgtgcca 
ggcgctcttc 
cggtatcagc 
gaaagaacat 
tggcgttttt 
agaggtggcg 
tcgtgcgctc 
cgggaagcgt 
ttcgctccaa 
ccggtaacta 
ccactggtaa 
ggtggcctaa 
cagttacctt 
gcggtggttt 
atcctttgat 
ttttggtcat 
gttttaaatc 
tcagtgaggc 
ccgtcgtgta 
taccgcgaga 
gggccgagcg 
gccgggaagc 
ctacaggcat 
aacgatcaag 
gtcctccgat 
cactgcataa 
actcaaccaa 
caatacggga 
gttcttcggg 
ccactcgtgc 
caaaaacagg 



ccacgttctt 
tctattcttt 
tgatttaaca 
aaagtcccca 
caaccaggtg 
tcaattagtc 
ccagttccgc 
aggccgcctc 
gcttttgcaa 
gatgaggatc 
gggtggagag 
ccgtgttccg 
gtgccctgaa 
ttccttgcgc 
gcgaagtgcc 
tcatggctga 
accaagcgaa 
aggatgatct 
aggcgcgcat 
atatcatggt 
cggaccgcta 
aatgggctga 
ccttctatcg 
ccaagcgacg 
gttgggcttc 
catgctggag 
aagcaatagc 
tttgtccaaa 
cttggcgtaa 
acacaacata 
actcacatta 
gctgcattaa 
cgcttcctcg 
tcactcaaag 
gtgagcaaaa 
ccataggctc 
aaacccgaca 
tcctgttccg 
ggcgctttct 
gctgggctgt 
tcgtcttgag 
caggattagc 
ctacggctac 
cggaaaaaga 
ttttgtttgc 
cttttctacg 
gagattatca 
aatctaaagt 
acctatctca 
gataactacg 
cccacgctca 
cagaagtggt 
tagagtaagt 
cgtggtgtca 
gcgagttaca 
cgttgtcaga 
ttctcttact 
gtcattctga 
taataccgcg 
gcgaaaactc 
acccaactga 
aaggcaaaat 



taatagtgga 
tgatttataa 
aaaatttaac 
ggctccccag 
tggaaagtcc 
agcaaccata 
ccattctccg 
tgcctctgag 
aaagctcccg 
gtttcgcatg 
gctattcggc 
gctgtcagcg 
tgaactgcag 
agctgtgctc 
ggggcaggat 
tgcaatgcgg 
acatcgcatc 
ggacgaagag 
gcccgacggc 
ggaaaatggc 
tcaggacata 
ccgcttcctc 
ccttcttgac 
cccaacctgc 
ggaatcgttt 
ttcttcgccc 
atcacaaatt 
ctcatcaatg 
tcatggtcat 
cgagccggaa 
attgcgttgc 
tgaatcggcc 
ctcactgact 
gcggtaatac 
ggccagcaaa 
cgcccccctg 
ggactataaa 
accctgccgc 
caatgctcac 
gtgcacgaac 
tccaacccgg 
agagcgaggt 
actagaagga 
gttggtagct 
aagcagcaga 
gggtctgacg 
aaaaggatct 
atatatgagt 
gcgatctgtc 
atacgggagg 
ccggctccag 
cctgcaactt 
agttcgccag 
cgctcgtcgt 
tgatccccca 
agtaagttgg 
gtcatgccat 
gaatagtgta 
ccacatagca 
tcaaggatct 
tcttcagcat 
gccgcaaaaa 



ctcttgttGC 
gggattttgg 
gcgaattaat 
gcaggcagaa 
ccaggctccc 
gtcccgcccc 
ccccatggct 
ctattccaga 
ggagcttgta 
attgaacaag 
tatgactggg 
caggggcgcc 
gacgaggcag 
gacgttgtca 
ctcctgtcat 
cggctgcata 
gagcgagcac 
catcaggggc 
gaggatctcg 
cgcttttctg 
gcgttggcta 
gtgctttacg 
gagttcttct 
catcacgaga 
tccgggacgc 
accccaactt 
tcacaaataa 
tatcttatca 
agctgtttcc 
gcataaagtg 
gctcactgcc 
aacgcgcggg 
cgctgcgctc 
ggttatccac 
aggccaggaa 
acgagcatca 
gataccaggc 
ttaccggata 
gctgtaggta 
cccccgttca 
taagacacga 
atgtaggcgg 
cagtatttgg 
cttgatccgg 
ttacgcgcag 
ctcagtggaa 
tcacctagat 
aaacttggtc 
tatttcgttc 
gcttaccatc 
atttatcagc 
tatccgcctc 
ttaatagttt 
ttggtatggc 
tgttgtgcaa 
ccgcagtgtt 
ccgtaagatg 
tgcggcgacc 
gaactttaaa 
taccgctgtt 
cttttacttt 
agggaataag 



4560 

4 620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920' 

7980 

8040 

8100 

8160 

8220 
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ggcgacacgg aaatgttgaa tactcatact 
tcagggttat tgtctcatga gcggatacat 
aggggttccg cgcacatttc cccgaaaagt 



-37- 

cttccttttt caatattatt gaagcattta 8280 

atttgaatgt atttagaaaa ataaacaaat B340 

gccacctgac gtc 8383 



<210> 44 
<211> 7960 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: piasmid 

<400> 44 ^ ^ ^ry 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 60 
ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 120 
cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 180 
ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 240 
gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 300 
tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 360 
cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420 
attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 480 
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 54 0 
atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600 
tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660 
actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 720 
aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 780 
gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 84 0 
ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gctggctagc 900 
gtttaaactt aagcttggta ccgagctcgg atccactctc ttccgcatcg ctgtctgcga 960 
gggccagctg ttggggtgag tactccctct gaaaagcggg catgacttct gcgctaagat 1020 
tgtcagtttc caaaaacgag gaggatttga tattcacctg gcccgcggtg atgcctttga 1080 
gggtggccgc atccatctgg tcagaaaaga caatcttttt gttgtcaagc ttggtggcaa 114 0 
acgacccgta gagggcgttg gacagcaact tggcgatgga gcgcagggtt tggtttttgt 1200 
cgcgatcggc gcgctccttg gccgcgatgt ttagctgcac gtattcgcgc gcaacgcacc 1260 
gccattcggg aaagacggtg gtgcgctcgt cgggcaccag gtgcacgcgc caaccgcggt 1320 
tgtgcagggt gacaaggtca acgctggtgg ctacctctcc gcgtaggcgc tcgttggtcc 1380 
agcagaggcg gccgcccttg cgcgagcaga atggcggtag ggggtctagc tgcgtctcgt 1440 
ccggggggtc tgcgtccacg gtaaagaccc cgggcagcag gcgcgcgtcg aagtagtcta 1500 
tcttgcatcc ttgcaagtct agcgcctgct gccatgcgcg ggcggcaagc gcgcgctcgt 1560 
atgggttgag tgggggaccc catggcatgg ggtgggtgag cgcggaggcg tacatgccgc 1620 
aaatgtcgta aacgtagagg ggctctctga gtattccaag atatgtaggg tagcatcttc 1680 
caccgcggat gctggcgcgc acgtaatcgt atagttcgtg cgagggagcg aggaggtcgg 17 40 
gaccgaggtt gctacgggcg ggctgctctg ctcggaagac tatctgcctg aagatggcat 1800 
gtgagttgga tgatatggtt ggacgctgga agacgttgaa gctggcgtct gtgagaccta 1860 
ccqcgtcacg cacgaaggag gcgtaggagt cgcgcagctt gttgaccagc tcggcggtga 1920 
cctgcacgtc tagggcgcag tagtccaggg tttccttgat gatgtcatac ttatcctgtc 1980 
cctttttttt ccacagctcg cggttgagga caaactcttc gcggtctttc cagtactctt 2040 
ggatcggaaa cccgtcggcc tccgaacgag atccgtactc cgccgccgag ggacctgagc 2100 
gagtccgcat cgaccggatc ggaaaacctc tcgagaaagg cgtctaacca gtcacagtcg 2160 
caagatccaa gatgaagcgc gcaagaccgt ctgaagatac cttcaacccc gtgtatccat 2220 
atgacacgga aaccggtcat ccaactgtgc cttttcttac tcctcccttt gtatccccca 2280 
atgggtttca agagagtccc cctggggtac tctctttgcg cctatccgaa cctctagtta 2340 
cctccaatgg catgcttgcg ctcaaaatgg gcaacggcct ctctctggac gaggccggca 2400 
accttacctc ccaaaatgta accactgtga gcccacctct caaaaaaacc aagtcaaaca 24 60 
taaacctgga aatatctgca cccctcacag ttacctcaga agccctaact gtggctgccg 2520 
ccgcacctct aatggtcgcg ggcaacacac tcaccatgca atcacaggcc ccgctaaccg 2580 
tgcacgactc caaacttagc attgccaccc aaggacccct cacagtgtca gaaggaaagc 264 0 
tagccctgca aacatcaggc cccctcacca ccaccgatag cagtaccctt actatcactg 2700 
cctcaccccc tctaactact gccactggta gcttgggcat tgacttgaaa gagcccattt 27 60 
atacacaaaa tggaaaacta ggactaaagt acggggctcc tttgcatgta acagacgacc 2820 
taaacacttt gaccgtagca actggtccag gtgtgactat taataatact tccttgcaaa 2880 
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ctaaagttac tggagecttg ggttttgatt -caaggcaa tatgcaactt -tgtagcag 2940 
gaggactaag gattgattct caaaacagac g^^^tatact ^gatgrcag ^ 30 go 

atgctcaaaa ccaactaaat ctaagactag ^mUl^l^ tacagcttca aacaatLca 3120 
acaacttgga tattaactac aacaaaggcc tttacttgtt ^acagcttc 3^3^ 
aaaagcttga ggttaaccta agcactgcca ^gg^gj^^at |tttgacg ^^^^^^^ 3340 

ccattaatgc aggagatggg cttgaatttg 9"^^°=^'^ claqqctatg gttcctaaac 3300 
tcaaaacaaa aattggccat ggcctagaat ttgattcaaa "^g^^tatg HH 33^^ 
taggaactgg ccttagtttt gacagcacag ^tgccattac ^11^ aatgcagaga 3420 

ataagctaac tttgtggacc -accagctc c^tctcctaa ctgtagacta -J^^g^g^ ^^^^ 
aagatgctaa actcactttg ^tcttaacaa ^atgrgg g agtgctcatc 3540 

cagttttggc tgttaaaggc agtttggctc "atatctgg aacag ^^^^^ 3600 

tcagtcaagt ttacttaaac ggagacaaaa ctaaacctgr ^^^^^ ttttcatgqg 3840 

.clfcac g,...c.,g. ^■"•""JJ ^^tlll^^l HI^Ttlt aS«ttSI 3500 

ir.t"i".t '.vx.ir.i iT'^l^'i iitir^ii i%r^^r,i Voti 

,.tc4c=t= ,.«gtgcct tot.gttgoc •9"|t«,t '^tttgccoc 'ccoccgt, 

S^cJ^JSg 1 1^ ,g J»S 

.gggggagg. ttg,g..g.= ""9"9gc •'|«'!|^| clSg'gHc W^gSgoSI "60 
otg.ggo,,. c"?Llctt glc44o<=c «20 

Srogclcll llllllllll I « t . gc =.c,„cg=. g,=«.cccc J3SC 

SlllS^f. :"^^KIlg g ?g««S lSt4'« -Hs«g SISSSII "eS 

^-^s-? ^xs^ "-'^-H ^ JSS 

rJ?l?Sgf f4"-4g^^ ItlSSr. H ^; Jg I 

cgcccatccc gcccctaact ccgcccagtt ccgcccatto ^^^^ caaaaataqt 4980 

t?ttt:tttat ttatgcagag gccgaggccg -tctgcctc tgagctattc -gaagtagt ^^^^ 

gaggaggctt ttttggaggc c:taggctttt ^^^aaaagct ^^^9^9 9 tlggcatagt 5100 

ttttcggatc tgatcagcac gtgttgacaa "aatcatcg 9=^^ g tgct 5160 

ataatacgac aaggtgagga actaaaccat ggccaagttg accagtgccg ^^^^^^ « 5220 

caccgcgcgc galgtlgccg gagcggtcga gttctggacc gaccggctcg g^ttctccog 5220 

^^^^a-^g^ Kfa =c S 3340 

fc^S lllllllllo g^gggggc^g Sg"crclc tglgcgaccc 5460 

Igc^^gcaa? tgc^t^cac? ?c?tggccga ^^agcaggac tgacacgtgc tacgagattt 520 

srcg i b ^"S 

^^^^ ^^^^ gttgfgg? ™- Ull 

Sea ^cfa^?c^^^^^^ ..-^^ 
agcctggggt gcctaatgag ^gagctaact "cattaatt ^^gt g gcgcggggag 6000 
tttccagtcg ggaaacctgt cgtgccagct ^^actd^ y tqcqctcggt 6060 

aggcggtttg cgtattgggc ^ctcttccgc ttcctcgctc actgactcgc H^l^J^^^ ^^^0 
ogttcggctg cggcgagcgg tatcagctoa °tcaaaggcg ^ ccaggaaccg 6180 

atcaggggat aacgcaggaa agaacatgtg agcaaaaggc ^^9= ^^^^^^ 6240. 

taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ^cccctg g ^ ^ 6300" 

aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accagg^g^^ ^^^^ 
tocccctgga agctccctcg tgcgctctcc tgttccgacc ^ ^ gtaggtatct 6420 

gtccgccttt ctcccttcgg gaagcgtggc 9^tttctcaa ^g«ca g | g^g^ 
cagttcggtg taggtcgttc gfccaagct g^g-tgtgtg galacgactt 6540 

:?cgcS ^g«~ ""---^ ^--''^^^ ^^^^^^^^^^ 



wo 00/42208 



PCT/EPOO/00265 



tacagagttc ttgaagtggt ggcctaacta 
ctgcgctctg ctgaagccag ttaccttcgg 
acaaaccacc gctggtagcg gtggtttttt 
aaaaggatct caagaagatc ctttgatctt 
aaactcacgt taagggattt tggtcatgag 
tttaaattaa aaatgaagtt ttaaatcaat 
cagttaccaa tgcttaatca gtgaggcacc 
catagttgcc tgactccccg tcgtgtagat 
ccccagtgct gcaatgatac cgcgagaccc 
aaaccagcca gccggaaggg ccgagcgcag 
ccagtctatt aattgttgcc gggaagctag 
caacgttgtt gccattgcta caggcatcgt 
attcagctcc ggttcccaac gatcaaggcg 
agcggttagc tccttcggtc ctccgatcgt 
actcatggtt atggcagcac tgcataattc 
ttctgtgact ggtgagtact caaccaagtc 
ttgctcttgc ccggcgtcaa tacgggataa 
gctcatcatt ggaaaacgtt cttcggggcg 
atccagttcg atgtaaccca ctcgtgcacc 
cagcgtttct gggtgagcaa aaacaggaag 
gacacggaaa tgttgaatac tcatactctt 
gggttattgt ctcatgagcg gatacatatt 
ggttccgcgc acatttcccc gaaaagtgcc 
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cggctacact agaaggacag tatttggtat 6660 
aaaaagagtt ggtagctctt gatccggcaa 6720 
tgtttgcaag cagcagatta cgcgcagaaa 6780 
ttctacgggg tctgacgctc agtggaacga 6B40 
attatcaaaa aggatcttca cctagatcct 6900 
ctaaagtata tatgagtaaa cttggtctga 6960 
tatctcagcg atctgtctat ttcgttcatc 7020 
aactacgata cgggagggct taccatctgg 7080 
acgctcaccg gctccagatt tatcagcaat 7140 
aagtggtcct gcaactttat ccgcctccat 7200 
agtaagtagt tcgccagtta atagtttgcg 7260 
ggtgtcacgc tcgtcgtttg gtatggcttc 7320 
agttacatga tcccccatgt tgtgcaaaaa 7380 
tgtcagaagt aagttggccg cagtgttatc 744 0 
tcttactgtc atgccatccg taagatgctt 7500 
attctgagaa tagtgtatgc ggcgaccgag 7560 
taccgcgcca catagcagaa ctttaaaagt 7 620 
aaaactctca aggatcttac cgctgttgag 7680 
caactgatct tcagcatctt ttactttcac 7740 
gcaaaatgcc gcaaaaaagg gaataagggc 7800 
cctttttcaa tattattgaa gcatttatca 7860 
tgaatgtatt tagaaaaata aacaaatagg 7920 
acctgacgtc * 7960 



<210> 45 
<211> 30 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description o 



f Artificial Sequence; primer 



<400> 45 30 
atgggatcca agatgaagcg cgcaagaccg 



<210> 46 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: prxmer 

<400> 46 3Q 
cactatagcg gccgcattct cagtcatctt 



<210> 47 
<211> 7989 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 

iitS? itlililtii iiT^iToil iini^^ =^ct liT^:T.ll |o 

calacaalat ttaagc^Jca acLggcaag gcttgaccga caattgcatg aagaatctgc 180 
t?agggttag gcgt?ttgcg ctgc?Lgcg atgtacgggc cagatatacg cgttgacatt 240 
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gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 300 
tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 3 60 
cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420 
attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 480 
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 54 0 
atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600 
tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660 
actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 720 
aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 780 
gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 840 
ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gctggctagc 900 
gtttaaactt aagcttggta ccgagctcgg atccactctc ttccgcatcg ctgtctgcga 960 
gggccagctg ttggggtgag tactccctct gaaaagcggg catgacttct gcgctaagat 1020 
tgtcagtttc caaaaacgag gaggatttga tattcacctg gcccgcggtg atgcctttga 1080 
gggtggccgc atccatctgg tcagaaaaga caatcttttt gttgtcaagc ttggtggcaa 1140 
acgacccgta gagggcgttg gacagcaact tggcgatgga gcgcagggtt tggtttttgt 1200 
cgcgatcggc gcgctccttg gccgcgatgt ttagctgcac gtattcgcgc gcaacgcacc 1260 
gccattcggg aaagacggtg gtgcgctcgt cgggcaccag gtgcacgcgc caaccgcggt 1320 
tgtgcagggt gacaaggtca acgctggtgg ctacctctcc gcgtaggcgc tcgttggtcc 1380 
agcagaggcg gccgcccttg cgcgagcaga atggcggtag ggggtctagc tgcgtctcgt 14 4 0 
ccggggggtc tgcgtccacg gtaaagaccc cgggcagcag gcgcgcgtcg aagtagtcta 1500 
tcttgcatcc ttgcaagtct agcgcctgct gccatgcgcg ggcggcaagc g^cgcgctcgt 1560 
atgggttgag tgggggaccc catggcatgg ggtgggtgag cgcggaggcg tacatgccgc 1620 
aaatgtcgta aacgtagagg ggctctctga gtattccaag atatgtaggg tagcatcttc 1680 
caccgcggat gctggcgcgc acgtaatcgt atagttcgtg cgagggagcg aggaggtcgg 17 4 0 
gaccgaggtt gctacgggcg ggctgctctg ctcggaagac tatctgcctg aagatggcat 1800 
gtgagttgga tgatatggtt ggacgctgga agacgttgaa gctggcgtct gtgagaccta 18 60 
ccgcgtcacg cacgaaggag gcgtaggagt cgcgcagctt gttgaccagc tcggcggtga 1920 
cctgcacgtc tagggcgcag tagtccaggg tttccttgat gatgtcatac ttatcctgtc 1980 
cctttttttt ccacagctcg cggttgagga caaactcttc gcggtctttc cagtactctt 204 0 
ggatcggaaa cccgtcggcc tccgaacgag atccgtactc cgccgccgag ggacctgagc 2100 
gagtccgcat cgaccggatc ggaaaacctc tcgagaaagg cgtctaacca gtcacagtcg 2160 
caagatccaa gatgaagcgc gcaagaccgt ctgaagatac cttcaacccc gtgtatccat 2220 
atgacacgga aaccggtcct ccaactgtgc cttttcttac tcctcccttt gtatccccca 2280 
atgggtttca agagagtccc cctggggtac tctctttgcg cctatccgaa cctctagtta 2340 
cctccaatgg catgcttgcg ctcaaaatgg gcaacggcct ctctctggac gaggccggca 2400 
accttacctc ccaaaatgta accactgtga gcccacctct caaaaaaacc aagtcaaaca 2460 
taaacctgga aatatctgca cccctcacag ttacctcaga agccctaact gtggctgccg 2520 
ccgcacctct aatggtcgcg ggcaacacac tcaccatgca atcacaggcc ccgctaaccg 2580 
tgcacgactc caaacttagc attgccaccc aaggacccct cacagtgtca gaaggaaagc 2 64 0 
tagccctgca aacatcaggc cccctcacca ccaccgatag cagtaccctt actatcactg 2700 
cctcaccccc tctaactact gccactggta gcttgggcat tgacttgaaa gagcccattt 2760 
atacacaaaa tggaaaacta ggactaaagt acggggctcc tttgcatgta acagacgacc 2820 
taaacacttt gaccgtagca actggtccag gtgtgactat taataatact tccttgcaaa 2880 
ctaaagttac tggagccttg ggttttgatt cacaaggcaa tatgcaactt aatgtagcag 2940 
gaggactaag gattgattct caaaacagac gccttatact tgatgttagt tatccgtttg 3000 
atgctcaaaa ccaactaaat ctaagactag gacagggccc tctttttata aactcagccc 3060 
acaacttgga tattaactac aacaaaggcc tttacttgtt tacagcttca aacaattcca 3120 
aaaagcttga ggttaaccta agcactgcca aggggttgat gtttgacgct acagccatag ' 3180 
ccattaatgc aggagatggg cttgaatttg gttcacctaa tgcaccaaac acaaatcccc 3240 
tcaaaacaaa aattggccat ggcctagaat ttgattcaaa caaggctatg gttcctaaac 3300 
taggaactgg ccttagtttt gacagcacag gtgccattac agtaggaaac aaaaataatg 3360 
ataagctaac tttgtggacc ggtccaaaac cagaagccaa ctgcataatt gaatacggga 3420 
aacaaaaccc agatagcaaa ctaactttaa tccttgtaaa aaatggagga attgttaatg 3480 
gatatgtaac gctaatggga gcctcagact acgttaacac cttatttaaa aacaaaaatg 354 0 
tctccattaa tgtagaacta tactttgatg ccactggtca tatattacca gactcatctt 3600 
ctcttaaaac agatctagaa ctaaaataca agcaaaccgc tgactttagt gcaagaggtt 3660 
ttatgccaag tactacagcg tatccatttg tccttcctaa tgcgggaaca cataatgaaa 3720 
attatatttt tggtcaatgc tactacaaag caagcgatgg tgcccttttt ccgttggaag 3780 
ttactgttat gcttaataaa cgcctgccag atagtcgcac atcctatgtt atgacttttt 384 0 
tatggtcctt gaatgctggt ctagctccag aaactactca ggcaaccctc ataacctccc 3900 
catttacctt ttcctatatt agagaagatg actgattttt aagaagcggc cgctcgagtc 3960 
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tagagggccc 
tgttgtttgc 
ttcctaataa 
gggtggggtg 
ggatgcggtg 
ggggtatccc 
cagcgtgacc 
ctttctcgcc 
gttccgattt 
acgtagtggg 
ctttaatagt 
ttttgattta 
acaaaaattt 
ccaggctccc 
gtgtggaaag 
gtcagcaacc 
cgcccattct 
ctctgcctct 
caaaaagctc 
taatcatcgg 
gccaagttga 
ttctggaccg 
gtccgggacg 
accctggcct 
gtgtccacga 
tgggggcggg 
gagcaggact 
ggcttcggaa 
ctggagttct 
aatagcatca 
tccaaactca 
gcgtaatcat 
aacatacgag 
acattaattg 
cattaatgaa 
tcctcgctca 
tcaaaggcgg 
gcaaaaggcc 
aggctccgcc 
ccgacaggac 
gttccgaccc 
ctttctcaat 
ggctgtgtgc 
cttgagtcca 
attagcagag 
ggctacacta 
aaaagagttg 
gtttgcaagc 
tctacggggt 
ttatcaaaaa 
taaagtatat 
atctcagcga 
actacgatac 
cgctcaccgg 
agtggtcctg 
gtaagtagtt 
gtgtcacgct 
gttacatgat 
gtcagaagta 
cttactgtca 
ttctgagaat 
accgcgccac 



gtttaaaccc 
ccctcccccg 
aatgaggaaa 
gggcaggaca 
ggctctatgg 
cacgcgccct 
gctacacttg 
acgttcgccg 
agtgctttac 
ccatcgccct 
ggactcttgt 
taagggatt t 
aacgcgaatt 
caggcaggca 
tccccaggct 
atagtcccgc 
ccgccccatg 
gagctattcc 
ccgggagctt 
catagtatat 
ccagtgccgt 
accggctcgg 
acgtgaccct 
gggtgtgggt 
acttccggga 
agttcgccct 
gacacgtgct 
tcgttttccg 
tcgcccaccc 
caaatttcac 
tcaatgtatc 
ggtcatagct 
ccggaagcat 
cgttgcgctc 
tcggccaacg 
ctgactcgct 
taatacggtt 
agcaaaaggc 
cccctgacga 
tataaagata 
tgccgcttac 
gctcacgctg 
acgaaccccc 
acccggtaag 
cgaggtatgt 
gaaggacagt 
gtagctcttg 
agcagattac 
ctgacgctca 
ggatcttcac 
atgagtaaac 
tctgtctatt 
gggagggctt 
ctccagattt 
caactttatc 
cgccagttaa 
cgtcgtttgg 
cccccatgtt 
agttggccgc 
tgccatccgt 
agtgtatgcg 
atagcagaac 



gctgatcagc 
tgccttcctt 
ttgcatcgca 
gcaaggggga 
cttctgaggc 
gtagcggcgc 
ccagcgccct 
gctttccccg 
ggcacctcga 
gatagacggt 
tccaaactgg 
tggggatttc 
aattctgtgg 
gaagtatgca 
ccccagcagg 
ccctaactcc 
gctgactaat 
agaagtagtg 
gtatatccat 
cggcatagta 
tccggtgctc 
gttctcccgg 
gttcatcagc 
gcgcggcctg 
cgcctccggg 
gcgcgacccg 
acgagatttc 
ggacgccggc 
caacttgttt 
aaataaagca 
ttatcatgtc 
gtttcctgtg 
aaagtgtaaa 
actgcccgct 
cgcggggaga 
gcgctcggtc 
atccacagaa 
caggaaccgt 
gcatcacaaa 
ccaggcgttt 
cggatacctg 
taggtatctc 
cgttcagccc 
acacgactta 
aggcggtgct 
atttggtatc 
atccggcaaa 
gcgcagaaaa 
gtggaacgaa 
ctagatcctt 
ttggtctgac 
tcgttcatcc 
accatctggc 
atcagcaata 
cgcctccatc 
tagtttgcgc 
tatggcttca 
gtgcaaaaaa 
agtgttatca 
aagatgcttt 
gcgaccgagt 
tttaaaagtg 



ctcgactgtg 
gaccctggaa 
ttgtctgagt 
ggattgggaa 
ggaaagaacc 
attaagcgcg 
agcgcccgct 
tcaagctcta 
ccccaaaaaa 
ttttcgccct 
aacaacactc 
ggcctattgg 
aatgtgtgtc 
aagcatgcat 
cagaagtatg 
gcccatcccg 
tttttttatt 
aggaggcttt 
tttcggatct 
taatacgaca 
accgcgcgcg 
gacttcgtgg 
gcggtccagg 
gacgagctgt 
ccggccatga 
gccggcaact 
gattccaccg 
tggatgatcc 
attgcagctt 
tttttttcac 
tgtataccgt 
tgaaattgtt 
gcctggggtg 
ttccagtcgg 
ggcggtttgc 
gttcggctgc 
tcaggggata 
aaaaaggccg 
aatcgacgct 
ccccctggaa 
tccgcctttc 
agttcggtgt 
gaccgctgcg 
tcgccactgg 
acagagttct 
tgcgctctgc 
caaaccaccg 
aaaggatctc 
aactcacgtt 
ttaaattaaa 
agttaccaat 
atagttgcct 
cccagtgctg 
aaccagccag 
cagtctatta 
aacgttgttg 
ttcagctccg 
gcggttagct 
ctcatggtta 
tctgtgactg 
tgctcttgcc 
ctcatcattg 



ccttctagtt 
ggtgccactc 
aggtgtcatt 
gacaatagca 
snccntagct 
gcgggtgtgg 
cctttcgctt 
aatcggggca 
cttgattagg 
ttgacgttgg 
aaccctatct 
ttaaaaaatg 
agttagggtg 
ctcaattagt 
caaagcatgc 
cccctaactc 
tatgcagagg 
tttggaggcc 
gatcagcacg 
aggtgaggaa 
acgtcgccgg 
aggacgactt 
accaggtggt 
acgccgagtg 
ccgagatcgg 
gcgtgcactt 
ccgccttcta 
tccagcgcgg 
ataatggtta 
tgcattctag 
cgacctctag 
atccgctcac 
cctaatgagt 
gaaacctgtc 
gtattgggcg 
ggcgagcggt 
acgcaggaaa 
cgttgctggc 
caagtcagag 
gctccctcgt 
tcccttcggg 
aggtcgttcg 
ccttatccgg 
cagcagccac 
tgaagtggtg 
tgaagccagt 
ctggtagcgg 
aagaagatcc 
aagggatttt 
aatgaagttt 
gcttaatcag 
gactccccgt 
caatgatacc 
ccggaagggc 
attgttgccg 
ccattgctac 
gttcccaacg 
ccttcggtcc 
tggcagcact 
gtgagtactc 
cggcgtcaat 
gaaaacgttc 



gccagccatc 
ccactgtcct 
ctattctggg 
ggcatgctgg 
ggggctctag 
tggttacgcg 
tcttcccttc 
tccctttagg 
gtgatggttc 
agtccacgtt 
cggtctattc 
agctgattta 
tggaaagtcc 
cagcaaccag 
atctcaatta 
cgcccagttc 
ccgaggccgc 
taggcttttg 
tgttgacaat 
ctaaaccatg 
agcggtcgag 
cgccggtgtg 
gccggacaac 
gtcggaggtc 
cgagcagccg 
cgtggccgag 
tgaaaggttg 
ggatctcatg 
caaataaagc 
ttgtggtttg 
ctagagcttg 
aattccacac 
gagctaactc 
gtgccagctg 
ctcttccgct 
atcagctcac 
gaacatgtga 
gtttttccat 
gtggcgaaac 
gcgctctcct 
aagcgtggcg 
ctccaagctg 
taactatcgt 
tggtaacagg 
gcctaactac 
taccttcgga 
tggttttttt 
tttgatcttt 
ggtcatgaga 
taaatcaatc 
tgaggcacct 
cgtgtagata 
gcgagaccca 
cgagcgcaga 
ggaagctaga 
aggcatcgtg 
atcaaggcga 
tccgatcgtt 
gcataattct 
aaccaagtca 
acgggataat 
ttcggggcga 



4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
618 0 
624 0 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
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aaactctcaa ggatcttacc 
aactgatctt cagcatcttt 
caaaatgccg caaaaaaggg 
ctttttcaat attattgaag 
gaatgtattt agaaaaataa 
cctgacgtc 



-42- 

gctgttgaga tccagttcga 
tactttcacc agcgtttctg 
aataagggcg acacggaaat 
catttatcag ggttattgtc 
acaaataggg gttccgcgca 



tgtaacccac tcgtgcaccc 774 0 
ggtgagcaaa aacaggaagg 7800 
gttgaatact catactcttc 7860 
tcatgagcgg atacatattt 7920 
catttccccg aaaagtgcca 7980 

7989 



<210> 48 
<211> 7607 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmxd 

<400> 48 ^ ^ ^ ^« 

tctagaagat ccgctgtaca ggatgttcta gctactttat tagatccgct gtacaggatg 60 
ttctagctac tttattagat ccgctgtaca ggatgttcta gctactttat tagatccgct 120 
gtacaggatg ttctagctac tttattagat ccgtgtacag gatgttctag ctactttatt 180 
agatcgatct cctggccgtt cggggtcaaa aaccaggttt ggctataaaa gggggtgggg 240 
qcgcgttcgt cctcactctc ttccgcatcg ctgtctgcga gggccaggat ^gatcctgag 300 
aacttcaggg tgagtttggg gacccttgat tgttctttct ttttcgctat tgtaaaattc 360 
atgttatatg gagggggcaa agttttcagg gtgttgttta gaatgggaag atgtcccttg 420 
tatcaccatg gaccctcatg ataattttgt ttctttcact ttctactctg ttgacaacca 480 
ttgtctcctc ttattttctt ttcattttct gtaacttttt cgttaaactt tagcttgcat 540 
ttgtaacgaa tttttaaatt cacttttgtt tatttgtcag attgtaagta ctttctctaa 600 
tcactttttt ttcaaggcaa tcagggtata ttatattgta cttcagcaca gttttagaga 660 
acaattgtta taattaaatg ataaggtaga atatttctgc atataaattc tggctggcgt 720 
ggaaatattc ttattggtag aaacaactac atcctggtca tcatcctgcc tttctcttta 780 
tggttacaat gatatacact gtttgagatg aggataaaat actctgagtc caaaccgggc 840 
ccctctgcta accatgttca tgccttcttc tttttcctac agctcctggg caacgtgctg 900 
gttattgtgc tgtctcatca ttttggcaaa gaattagatc taagcttctg cagctcgagg 960 
actcggtcga ctgaaaatga gacatattat ctgccacgga ggtgttatta ccgaagaaat 1020 
ggccgccagt cttttggacc agctgatcga agaggtactg gctgataatc ttccacctcc 1080 
tagccatttt gaaccaccta cccttcacga actgtatgat ttagacgtga cggcccccga 1140 
agatcccaac gaggaggcgg tttcgcagat ttttcccgac tctgtaatgt tggcggtgca 1200 
ggaagggatt gacttactca cttttccgcc ggcgcccggt tctccggagc cgcctcacct 1260 
ttcccggcag cccgagcagc cggagcagag agccttgggt ccggtttcta tgccaaacct 1320 
tgtaccggag gtgatcgatc ttacctgcca cgaggctggc tttccaccca gtgacgacga 1380 
ggatgaagag ggtgaggagt ttgtgttaga ttatgtggag caccccgggc acggttgcag 1440 
gtcttgtcat tatcaccgga ggaatacggg ggacccagat attatgtgtt cgctttgcta 1500 
tatgaggacc tgtggcatgt ttgtctacag taagtgaaaa ttatgggcag tgggtgatag 1560 
agtggtgggt ttggtgtggt aatttttttt ttaattttta cagttttgtg gtttaaagaa 1620 
ttttgtattg"tgattttttt aaaaggtcct gtgtctgaac ctgagcctga gcccgagcca 1680 
gaaccggagc ctgcaagacc tacccgccgt cctaaaatgg cgcctgctat cctgagacgc 1740 
ccgacatcac ctgtgtctag agaatgcaat agtagtacgg atagctgtga ctccggtcct 1800 
tctaacacac ctcctgagat acacccggtg gtcccgctgt gccccattaa accagttgcc 1860 
gtgagagttg gtgggcgtcg ccaggctgtg gaatgtatcg aggacttgct taacgagcct 1920 
gggcaacctt tggacttgag ctgtaaacgc cccaggccat aaggtgtaaa cctgtgattg 1980 
cgtgtgtggt taacgccttt gtttgctgaa tgagttgatg taagtttaat aaagggtgag 2040 
ataatgttta acttgcatgg cgtgttaaat ggggcggggc ttaaagggta tataatgcgc 2100 
cgtgggctaa tcttggttac atctgacctc atggaggctt gggagtgttt ggaagatttt 2160 
tctgctgtgc gtaacttgct ggaacagagc tctaacagta cctcttggtt ttggaggttt 2220 
ctgtggggct catcccaggc aaagttagtc tgcagaatta aggaggatta caagtgggaa 2280 
tttgaagagc ttttgaaatc ctgtggtgag ctgtttgatt ctttgaatct gggtcaccag 2340 
gcgcttttcc aagagaaggt catcaagact ttggattttt ccacaccggg gcgcgctgcg 2400 
gctgctgttg cttttttgag ttttataaag gataaatgga gcgaagaaac ccatctgagc 2460 
ggggggtacc tgctggattt tctggccatg catctgtgga gagcggttgt gagacacaag 2520 
aatcgcctgc tactgttgtc ttccgtccgc ccggcgataa taccgacgga ggagcagcag 2580 
cagcagcagg aggaagccag gcggcggcgg caggagcaga gcccatggaa cccgagagcc 2640 
ggcctggacc ctcgggaatg aatgttgtac aggtggctga actgtatcca gaactgagac 2700 
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gcattttgac aattacagag gatgggcagg 
cttgtgaggc tacagaggag gctaggaatc 
ctgagtgtat tacttttcaa cagatcaagg 
cgcagaagta ttccatagag cagctgacca 
aggaggctat tagggtatat gcaaaggtgg 
gcaaacttgt aaatatcagg aattgttgct 
tagatacgga ggatagggtg gcctttagat 
ttggcatgga cggggtggtt attatgaatg 
cggttttcct ggccaatacc aaccttatcc 
atacctgtgt ggaagcctgg accgatgtaa 
ggaagggggt ggtgtgtcgc cccaaaagca 
ggtgtacctt gggtatcctg tctgagggta 
actgtggttg cttcatgcta gtgaaaagcg 
gcaactgcga ggacagggcc tctcagatgc 
tgaagaccat tcacgtagcc agccactctc 
tactgacccg ctgttccttg catttgggta 
gcaatttgag tcacactaag atattgcttg 
acggggtgtt tgacatgacc atgaagatct 
ccaggtgcag accctgcgag tgtggcggta 
atgtgaccga ggagctgagg cccgatcact 
gctctagcga tgaagataca gattgaggta 
gaaagaatat ataaggtggg ggtcttatgt 
ccgccatgag caccaactcg tttgatggaa 
tgcccccatg ggccggggtg cgtcagaatg 
tcctgcccgc aaactctact accttgacct 
ctgcagcctc cgccgccgct tcagccgctg 
ttgctttcct gagcccgctt gcaagcagtg 
agttgacggc tcttttggca caattggatt 
agcagctgtt ggatctgcgc cagcaggttt 
cggtttaaaa cataaataaa aaaccagact 
gctgtctcag ctgactgctt aagtcgcaag 
ttaaagcaga acttgtttat tgcagcttat 
aatttcacaa ataaagcatt tttttcactg 
aatgtatctt atcatgtctg gtcgactcta 
ctgcgctcgg tcgttcggct gcggcgagcg 
ttatccacag aatcagggga taacgcagga 
gccaggaacc gtaaaaaggc cgcgttgctg 
gagcatcaca aaaatcgacg ctcaagtcag 
taccaggcgt ttccccctgg aagctccctc 
accggatacc tgtccgcctt tctcccttcg 
tgtaggtatc tcagttcggt gtaggtcgtt 
cccgttcagc ccgaccgctg cgccttatcc 
agacacgact tatcgccact ggcagcagcc 
gtaggcggtg ctacagagtt cttgaagtgg 
gtatttggta tctgcgctct gctgaagcca 
tgatccggca aacaaaccac cgctggtagc 
acgcgcagaa aaaaaggatc tcaagaagat 
cagtggaacg aaaactcacg ttaagggatt 
acctagatcc ttttaaatta aaaatgaagt 
acttggtctg acagttacca atgcttaatc 
tttcgttcat ccatagttgc ctgactcccc 
ttaccatctg gccccagtgc tgcaatgata 
ttatcagcaa taaaccagcc agccggaagg 
tccgcctcca tccagtctat taattgttgc 
aatagtttgc gcaacgttgt tgccattgct 
ggtatggctt cattcagctc cggttcccaa 
ttgtgcaaaa aagcggttag ctccttcggt 
gcagtgttat cactcatggt tatggcagca 
gtaagatgct tttctgtgac tggtgagtac 
cggcgaccga gttgctcttg cccggcgtca 
actttaaaag tgctcatcat tggaaaacgt 
ccgctgttga gatccagttc gatgtaaccc 



-43- 

ggctaaaggg ggtaaagagg gagcgggggg 2760 
tagcttttag cttaatgacc agacaccgtc 2820 
ataattgcgc taatgagctt gatctgctgg 2880 
cttactggct gcagccaggg gatgattttg 2940 
cacttaggcc agattgcaag tacaagatca 3000 
acatttctgg gaacggggcc gaggtggaga 3060 
gtagcatgat aaatatgtgg ccgggggtgc 3120 
taaggtttac tggccccaat tttagcggta 3180 
tacacggtgt aagcttctat gggtttaaca 3240 
gggttcgggg ctgtgccttt tactgctgct 3300 
gggcttcaat taagaaatgc ctctttgaaa 3360 
actccagggt gcgccacaat gtggcctccg 3420 
tggctgtgat taagcataac atggtatgtg 3480 
tgacctgctc ggacggcaac tgtcacctgc 3540 
gcaaggcctg gccagtgttt gagcataaca 3600 
acaggagggg ggtgttccta ccttaccaat 3660 
agcccgagag catgtccaag gtgaacctga .3720 
ggaaggtgct gaggtacgat gagacccgca 3780 
aacatattag gaaccagcct gtgatgctgg 3840 
tggtgctggc ctgcacccgc gctgagtttg 3900 
ctgaaatgtg tgggcgtggc ttaagggtgg 3960 
agttttgtat ctgttttgca <^cagccgccg 4020 
gcattgtgag ctcatatttg acaacgcgca 4080 
tgatgggctc cagcattgat ggtcgccccg 4140 
acgagaccgt gtctggaacg ccgttggaga 4200 
cagccaccgc ccgcgggatt gtgactgact 42 60 
cagcttcccg ttcatccgcc cgcgatgaca 4320 
ctttgacccg ggaacttaat gtcgtttctc 4380 
ctgccctgaa ggcttcctcc cctcccaatg 44 40 
ctgtttggat ttggatcaag caagtgtctt 4500 
ccgaattgga tccaattcgg atcgatctta 4560 
aatggttaca aataaagcaa tagcatcaca 4620 
cattctagtt gtggtttgtc " caaactcatc 4680 
gactcttccg cttcctcgct cactgactcg 4740 
gtatcagctc actcaaaggc ggtaatacgg 4800 
aagaacatgt gagcaaaagg ccagcaaaag 4860 
gcgtttttcc ataggctccg cccccctgac 4920 
aggtggcgaa acccgacagg actataaaga 4 980 
gtgcgctctc ctgttccgac cctgccgctt 5040 
ggaagcgtgg cgctttctca tagctcacgc 5100 
cgctccaagc tgggctgtgt gcacgaaccc 5160 
ggtaactatc gtcttgagtc caacccggta 5220 
actggtaaca ggattagcag agcgaggtat 5280 
tggcctaact acggctacac tagaaggaca 5340 
gttaccttcg gaaaaagagt tggtagctct 5400 
ggtggttttt ttgtttgcaa gcagcagatt 54 60 
cctttgatct tttctacggg gtctgacgct 5520 
ttggtcatga gattatcaaa aaggatcttc 5580 
tttaaatcaa tctaaagtat atatgagtaa 5640 
agtgaggcac ctatctcagc gatctgtcta 5700 
gtcgtgtaga taactacgat acgggagggc 57 60 
ccgcgagacc cacgctcacc ggctccagat 5820 
gccgagcgca gaagtggtcc tgcaacttta 5880 
cgggaagcta gagtaagtag ttcgccagtt 594 0 
acaggcatcg tggtgtcacg ctcgtcgttt 6000 
cgatcaaggc gagttacatg atcccccatg 6060 
cctccgatcg ttgtcagaag taagttggcc 6120 
ctgcataatt ctcttactgt catgccatcc 6180 
tcaaccaagt cattctgaga atagtgtatg 6240 
atacgggata ataccgcgcc acatagcaga 6300 
tcttcggggc gaaaactctc aaggatctta 6360 
actcgtgcac ccaactgatc ttcagcatct 6420 
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tttactttca ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag 64 80 
ggaataaggg cgacacggaa atgttgaata ctcatactct tcctttttca atattattga 6540 
agcatttatc agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat 6600 
aaacaaatag gggttccgcg cacatttccc cgaaaagtgc cacctgacgt ctaagaaacc 6660 
attattatca tgacattaac ctataaaaat aggcgtatca cgaggcccct ttcgtctcgc 6720 
gcgtttcggt gatgacggtg aaaacctctg acacatgcag ctcccggaga cggtcacagc 6780 
ttgtctgtaa gcggatgccg ggagcagaca agcccgtcag ggcgcgtcag cgggtgttgg 6840 
cgggtgtcgg ggctggctta actatgcggc atcagagcag attgtactga gagtgcacca 6900 
tatgcggtgt gaaataccgc acagatgcgt aaggagaaaa taccgcatca ggaaattgta 6960 
agcgttaata ttttgttaaa attcgcgtta aatttttgtt aaatcagctc attttttaac 7020 
caataggccg aaatcggcaa aatcccttat aaatcaaaag aatagaccga gatagggttg 7 08O 
agtgttgttc cagtttggaa caagagtcca ctattaaaga acgtggactc caacgtcaaa 7140 
gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg aaccatcacc ctaatcaagt 7200 
tttttggggt cgaggtgccg taaagcacta aatcggaacc ctaaagggag cccccgattt 72 60 
agagcttgac* ggggaaagcc ggcgaacgtg gcgagaaagg aagggaagaa agcgaaagga 7 320 
gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc gcgtaaccac cacacccgcc 7380 
gcgcttaatg cgccgctaca gggcgcgtcc cattcgccat tcaggctgcg caactgttgg 7440 
gaagggcgat cggtgcgggc ctcttcgcta ttacgccagc tggcgaaagg gggatgtgct 7500 
gcaaggcgat taagttgggt aacgccaggg ttttcccagt cacgacgttg taaaacgacg 7560 
gccagtgaat tgtaatacga ctcactatag ggcgaattaa ttcgggg 7 607 



<210> 49 
<211> 11600 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 
<400> 49 

gaattccgca ttgcagagat attgtattta agtgcctagc tcgatacaat aaacgccatt 60 
tgaccattca ccacattggt gtgcacctcc aagcttgggc agaaatggtt gaactcccga 120 
gagtgtccta cacctagggg agaagcagcc aaggggttgt ttcccaccaa ggacgacccg 180 
tctgcgcaca aacggatgag cccatcagac aaagacatat tcattctctg ctgcaaactt 240 
ggcatagctc tgctttgcct- ggggctattg ggggaagttg cggttcgtgc tcgcagggct 300 
ctcacccttg actcttttaa tagctcttct gtgcaagatt acaatctaaa caattcggag 360 
aactcgacct tcctcctgag gcaaggacca cagccaactt cctcttacaa gccgcatcga 4 20 
ttttgtcctt cagaaataga aataagaatg cttgctaaaa attatatttt taccaataag 480 
accaatccaa taggtagatt attagttact atgttaagaa atgaatcatt atcttttagt 540 
actattttta ctcaaattca gaagttagaa atgggaatag aaaatagaaa gagacgctca 600 
acctcaattg aagaacaggt gcaaggacta ttgaccacag gcctagaagt aaaaaaggga 660 
aaaaagagtg tttttgtcaa aataggagac aggtggtggc aaccagggac ttatagggga 720 
ccttacatct acagaccaac agatgccccc ttaccatata caggaagata tgacttaaat 780 
tgggataggt gggttacagt caatggctat aaagtgttat atagatccct cccttttcgt 840 
gaaagactcg ccagagctag acctccttgg tgtatgttgt ctcaagaaga aaaagacgac 900 
atgaaacaac aggtacatga ttatatttat ctaggaacag gaatgcactt ttggggaaag 960 
attttccata ccaaggaggg gacagtggct ggactaatag aacattattc tgcaaaaact 1020 
catggcatga gttattatga atagccttta ttggcccaac cttgcggttc ccagggctta 1080 
agtaagtttt tggttacaaa ctgttcttaa aacgaggatg tgagacaagt ggtttcctga 114 0 
cttggtttgg tatcaaaggt tctgatctga gctctgagtg ttctattttc ctatgttctt 1200 
ttggaattta tccaaatctt atgtaaatgc ttatgtaaac caagatataa aagagtgctg 1260 
attttttgag taaacttgca acagtcctaa cattcacctc ttgtgtgttt gtgtctgttc 1320 
gccatcccgt ctccgctcgt cacttatcct tcactttcca gagggtcccc ccgcagaccc 1380 
cggcgaccct caggtcggcc gactgcggca gctggcgccc gaacagggac cctcggataa 144 0 
gtgacccttg tctctatttc tactatttgg tgtttgtctt gtattgtctc tttcttgtct 1500 
ggctatcatc acaagagcgg aacggactca ccatagggac caagctagcg cttctcgtcg 1560 
cgtccaagac cctcaaagat ttttggcact tcgttgagcg aggcgatatc aggtatgaca 1620 
gcgccctgcc gcaaggccag ctgcttgtcc gctcggctgc ggttggcacg gcaggatagg 1680 
ggtatcttgc agttttggaa aaagatgtga taggtggcaa gcacctctgg cacggcaaat 1740 
acggggtaga agttgaggcg cgggttgggc tcgcatgtgc cgttttcttg gcgtttgggg 1800 
ggtacgcgcg gtgagaatag gtggcgttcg taggcaaggc tgacatccgc tatggcgagg 1860 
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ggcacatcgc tgcgctcttg caacgcgtcg 
ttcaacagca cgtcgtctcc cacatctagg 
acttgttcct cgtttgcctc tgcgttgtcc 
gagcggtcct cgtcgtcttc gcttacaaaa 
tcctcaagcg ggggtgcctc gacggggaag 
gcggtggtgg cgaactcaga gggggcggtt 
atctttttct gcctatagga gaaggaaatg 
acccccgagc gcggacgcgg tgcggcgcga 
ccgtccccgt cgccgccgcc tccccgggcg 
gagtccgagg acgaggaaga ctcatcacaa 
cggccatcga cctcggcggc ggatttggcc 
tctcccaagc ccgagcgccc gccatcacca 
gaagatgtgg cgctacaaat ggtgggtttc 
aaaggaggta agcgcacagt gcggcggctg 
cggacgcaag aggaagagga agagcccagc 
ccgctgagtg tgccgatcgt gtctgcgtgg 
atggacaagt accacgtgga taacgatcta 
gtggaagctc tggcggccgt atgcaagacc 
ctgaccttca ccagcaacaa gacctttgtg 
ctgcagtcgt ttgcagaggt gacctacaag 
ctgcaccgct gcgctgagat cgaaggcgag 
aataaggagc acgtgattga aatggatgtg 
gagcagtcta gcaaggccaa gatcgtgaag 
tccaacaccg acgcaaggtg ctgcgtgcac 
ggcaagtctt gcggcatgtt cttctctgaa 
atcaaggctt ttatgcaggc gctgtatcct 
atgccactac ggtgcgagtg caactcaaag 
ctaccaaagt tgactccgtt cgccctgagc 
tccgacaaga gcgtgctggc cagcgtgcac 
aaccctgtgt atcgcaactc gcgcgcgcag 
tcggcgcccg acctgctaaa cgcgttggtg 
accgagctgc cgcggatggt tgtgcctgag 
aacgtgtccc tgccagtggc gcatagcgat 
gcgcagacgg caagggtggg ggtaaataat 
cctttattga aagtgtctct agtacattat 
gtggcgctcc taatctgcgc actgtggctg 
ctgtagagct gttcctggtt gcgacgcagg 
gagttgggta ccccggtaat aaggttcatg 
cagttggcaa aggcgtggag aaacatgcag 
ccctgtacgc tttgggtgga cttttccagc 
gcgctacggc gcaggagtga ctcgtactca 
gaaaagccaa agggctcaaa gaggtagcat 
atccagtgta cgcccccagt ctcgcgacqg 
tgtggagaaa caaagcctgg aaagcgcttg 
ccaagatctt tgacaatggc tttcagttcc 
gttgatgttg cttgcttctt tatgttgtgg 
acacggtttc gatgacgccg cggtgcggcc 
acaaaacata aagaagggtg ggctcgtcca 
ttacctcagg tcgacctcga gggatctttg 
aattggacaa actacctaca gagatttaaa 
tataatgtgt taaactactg attctaattg 
ctgatgaatg ggagcagtgg tggaatgcct 
aaatgccatc tagtgatgat gaggctactg 
agaagagaaa ggtagaagac cccaaggact 
atgctgtgtt tagtaataga actcttgctt 
ctgcactgct atacaagaaa attatggaaa 
acagttataa tcataacata ctgttttttc 
ttaataacta tgctcaaaaa ttgtgtacct 
aggaatattt gatgtatagt gccttgacta 
gaggttttac ttgctttaaa aaacctccca 
aatgcaattg ttgttgttaa cttgtttatt 
agcatcacaa atttcacaaa taaagcattt 



•45' 

cagataatgg cgcactggcg ctgcagatgc 1920 
tagtcgccat gcctttcgtc cccccgcccg 1980 
tggtcttgct ttttatcctc tgttggtact 2040 
cctgggtcct gctcgataat cacttcctcc 2100 
gtggtaggcg cgttggcggc atcggtggag 2160 
aggctgtcct tcttctcgac tgactccatg 2220 
gccagtcggg aagaggagca gcgcgaaacc 2280 
cgtcccccaa ccatggagga cgtgtcgtcc 2340 
cccccaaaaa agcggatgag gcggcgtatc 2400 
gacgcgctgg tgccgcgcac acccagcccg 2460 
attgcgccca agaagaaaaa gaagcgccct 2520 
gaggtaatcg tggacagcga ggaagaaaga 2580 
agcaacccac cggtgctaat caagcatggc 2640 
aatgaagacg acccagtggc gcgtggtatg 2700 
gaagcggaaa gtgaaattac ggtgatgaac 2760 
gagaagggca tggaggctgc gcgcgcgctg 2820 
aaggcgaact tcaaactact gcctgaccaa 2880 
tggctgaacg aggagcaccg cgggttgcag 2940 
acgatgatgg ggcgattcct gcaggcgtac 3000- 
catcacgagc ccacgggctg cgcgttgtgg 3060 
cttaagtgtc tacacggaag cattatgata 3120 
acgagcgaaa acgggcagcg cgcgctgaag 3180 
aaccggtggg gccgaaatgt ggtgcagatc 3240 
gacgcggcct gtccggccaa tcagttttcc 3300 
ggcgcaaagg ctcaggtggc ttttaagcag 3360 
aacgcccaga ccgggcacgg tcaccttttg 3420 
cctgggcacg cgcccttttt gggaaggcag 34 80 
aacgcggagg acctggacgc ggatctgatc 354 0 
cacccggcgc tgatagtgtt ccagtgctgc 3600 
ggcggaggcc ccaactgcga cttcaagata 3660 
atggtgcgca gcctgtggag tgaaaacttc 3720 
tttaagtgga gcactaaaca ccagtatcgc 3780 
gcgcggcaga acccctttga tttttaaacg 384 0 
cacccgagag tgtacaaata aaagcatttg 3900 
ttttacatgt ttttcaagtg acaaaaagaa 3960 
cggaagtagg gcgagtggcg ctccaggaag 4020 
gtgggctgta cctggggact gttgagcatg 4080 
gtggggttgt gatccatggg agtttggggc 4140 
cagaatagtc cacaggcggc cgagttgggc 4200 
gttatacagc ggtcggggga agaagcaatg 4 260 
aactggtaaa cctgcttgag tcgctggtca 4320 
gtttttgagt gcgggttcca ggcaaaggcc 4380 
gccgtattga ctatggcgca ggcgagcttg 4440 
tcataggtgc ccaaaaaata tggcccacaa 4500 
tgctcactgg agcccatggc ggcagctgtt 4560 
cgttgccggc cgagaagggc gtgcgcaggt 4620 
ggtgcacacg gaccacgtca aagacttcaa 4 680 
tgggatccat atatagggcc cgggttataa 474 0 
tgaaggaacc ttacttctgt ggtgtgacat 4800 
gctctaaggt aaatataaaa tttttaagtg 4860 
tttgtgtatt ttagattcca acctatggaa 4920 
ttaatgagga aaacctgttt tgctcagaag 4980 
ctgactctca acattctact cctccaaaaa 504 0 
ttccttcaga attgctaagt tttttgagtc 5100 
gctttgctat ttacaccaca aaggaaaaag 5160 
aatattctgt aacctttata agtaggcata 5220^ 
ttactccaca caggcataga gtgtctgcta 5280 
ttagcttttt aatttgtaaa ggggttaata 5340 
gagatcataa tcagccatac cacatttgta 5400 
cacctccccc tgaacctgaa acataaaatg 5460 
gcagcttata atggttacaa ataaagcaat 5520 
ttttcactgc attctagttg tggtttgtcc 5580 
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aaactcatca 
tgtggaaagt 
tcagcaacca 
catctcaatt 
ccgcccagtt 
gccgaggccg 
ctaggctttt 
ggaagcggaa 
gctactgggc 
gtgggcttac 
tgccagctgg 
tcttgccgcc 
gatcgtttcg 
agaggctatt 
tccggctgtc 
tgaatgaact 
gcgcagctgt 
tgccggggca 
ctgatgcaat 
cgaaacatcg 
atctggacga 
gcatgcccga 
tggtggaaaa 
gctatcagga 
ctgaccgctt 
atcgccttct 
gacgcccaac 
cttcggaatc 
ggagttcttc 
gctggacgac 
cagcggctat 
tttggtcccg 
tacctacaga 
aactactgat 
agcagtggtg 
gtgatgatga 
tagaagaccc 
gtaatagaac 
acaagaaaat 
ataacatact 
ctcaaaaatt 
tgtatagtgc 
gctttaaaaa 
gttgttaact 
ttcacaaata 
gtatcttatc 
ctctacttga 
aattaggtca 
ggtaatttta 
cagcccacaa 
accctgctca 
attttcccca 
aacccagcac 
catctgctga 
cctgtagttt 
aaaatttgac 
atgcaagttt 
catcaaaata 
cgaaagggcc 
tagacgtcag 
taaatacatt 
tattgaaaaa 



atgtatctta 
ccccaggctc 
ggtgtggaaa 
agtcagcaac 
ccgcccattc 
cctcggcctc 
gcaaaaagct 
cacgtagaaa 
tatctggaca 
atggcgatag 
ggcgccctct 
aaggatctga 
catgattgaa 
cggctatgac 
agcgcagggg 
gcaggacgag 
gctcgacgtt 
ggatctcctg 
gcggcggctg 
catcgagcga 
agagcatcag 
cggcgaggat 
tggccgcttt 
catagcgttg 
cctcgtgctt 
tgaqgagttc 
ctgccatcac 
gttttccggg 
gcccaccccg 
ctcgcggagt 
ccgcgcatcc 
gatctttgtg 
gatttaaagc 
tctaattgtt 
gaatgccttt 
ggctactgct 
caaggacttt 
tcttgcttgc 
tatggaaaaa 
gttttttctt 
gtgtaccttt 
cttgactaga 
acctcccaca 
tgtttattgc 
aagcattttt 
atgtctggat 
gaggacattc 
cttaacaaaa 
aaatatctgg 
atgtcaacag 
tcaagaagca 
cctgtgtagg 
tccactggat 
ctgtcaactg 
gctaacacac 
ccttgaatgg 
aacatagcag 
tttccacagg 
tcgtgatacg 
gtggcacttt 
caaatatgta 
ggaagagtat 



tcatgtctgg 
cccagcaggc 
gtccccaggc 
catagtcccg 
tccgccccat 
tgagctattc 
tcacgctgcc 
gccagtccgc 
agggaaaacg 
ctagactggg 
ggtaaggttg 
tggcgcaggg 
caagatggat 
tgggcacaac 
cgcccggttc 
gcagcgcggc 
gtcactgaag 
tcatctcacc 
catacgcttg 
gcacgtactc 
gggctcgcgc 
ctcgtcgtga 
tctggattca 
gctacccgtg 
tacggtatcg 
ttctgagcgg 
gagatttcga 
acgccggctg 
ggctcgatcc 
tctaccggca 
atgcccccga 
aaggaacctt 
tctaaggtaa 
tgtgtatttt 
aatgaggaaa 
gactctcaac 
ccttcagaat 
tttgctattt 
tattctgtaa 
actccacaca 
agctttttaa 
gatcataatc 
cctccccctg 
agcttataat 
ttcactgcat 
ccccaggaag 
caatcatagg 
aggaaattgg 
gaagtccctt 
cagaaacata 
ctgtggttgc 
ttccaaaata 
aagcattatc 
tagcattttt 
cctgcagctc 
gttttccagc 
ttaccccaat 
ttaagtcctc 
cctattttta 
tcggggaaat 
tccgctcatg 
gagtattcaa 



atccggctgt 
agaagtatgc 
tccccagcag 
cccctaactc 
ggctgactaa 
cagaagtagt 
gcaagcactc 
agaaacggtg 
caagcgcaaa 
cggttttatg 
ggaagccctg 
gatcaagatc 
tgcacgcagg 
agacaatcgg 
tttttgtcaa 
tatcgtggct 
cgggaaggga 
ttgctcctgc 
atccggctac 
ggatggaagc 
cagccgaact 
cccatggcga 
tcgactgtgg 
atattgctga 
ccgctcccga 
gactctgggg 
ttccaccgcc 
gatgatcctc 
cctcgcgagt 
gtgcaaatcc 
actgcaggag 
acttctgtgg 
atataaaatt 
agattccaac 
acctgttttg 
attctactcc 
tgctaagttt 
acaccacaaa 
cctttataag 
ggcatagagt 
tttgtaaagg 
agccatacca 
aacctgaaac 
ggttacaaat 
tctagttgtg 
ctcctctgtg 
ctgcccatcc 
gtaggggttt 
ccactgctgt 
caagctgtca 
tgtgttagta 
tctagtgttt 
cttatccaaa 
tggggttaca 
caaaggttcc 
accattttca 
aacctcagtt 
atttaaatta 
taggttaatg 
gtgcgcggaa 
agacaataac 
catttccgtg 



ggaatgtgtg 
aaagcatgca 
gcagaagtat 
cgcccatccc 
ttttttttat 
gaggaggctt 
agggcgcaag 
ctgaccccgg 
gagaaagcag 
gacagcaagc 
caaagtaaac 
tgatcaagag 
ttctccggcc 
ctgctctgat 
gaccgacctg 
ggccacgacg 
ctggctgcta 
cgagaaagta 
ctgcccattc 
cggtcttgtc 
gttcgccagg 
tgcctgcttg 
ccggctgggt 
agagcttggc 
ttcgcagcgc 
ttcgaaatga 
gccttctatg 
cagcgcgggg 
tggttcagct 
gtcggcatcc 
tggggaggca 
tgtgacataa 
tttaagtgta 
ctatggaact 
ctcagaagaa 
tccaaaaaag 
tttgagtcat 
ggaaaaagct 
taggcataac 
gtctgctatt 
ggttaataag 
catttgtaga 
ataaaatgaa 
aaagcaatag 
gtttgtccaa 
tcctcataaa 
accctctgtg 
ttcacagacc 
gttccagaag 
gctttgcaca 
atgtgcaaaa 
tcatttttac 
acagccttgt 
gtttgagcag 
ccaccaacag 
tgagtttttt 
ttaacagtaa 
ggcaaaggaa 
tcatgataat 
cccctatttg 
cctgataaat 
tcgcccttat 



tcagttaggg 
tctcaattag 
gcaaagcatg 
gcccctaact 
ttatgcagag 
ttttggaggc 
ggctgctaaa 
atgaatgtca 
gtagcttgca 
gaaccggaat 
tggatggctt 
acaggatgag 
gcttgggtgg 
gccgccgtgt 
tccggtgccc 
ggcgttcctt 
ttgggcgaag 
tccatcatgg 
gaccaccaag 
gatcaggatg 
ctcaaggcgc 
ccgaatatca 
^tggcggacc 
ggcgaatggg 
atcgccttct 
ccgaccaagc 
aaaggttggg 
atctcatgct 
gctgcctgag 
aggaaaccag 
cgatggccgc 
ttggacaaac 
taatgtgtta 
gatgaatggg 
atgccatcta 
aagagaaagg 
gctgtgttta 
gcactgctat 
agttataatc 
aataactatg 
gaatatttga 
ggttttactt 
tgcaattgtt 
catcacaaat 
actcatcaat 
ccctaacctc 
tcctcctgtt 
gctttctaag 
tgttggtaaa 
agggcccaac 
caggaggcac 
ttggatcagg 
ggtcagtgtt 
gatatttggt 
caaaaaaatg 
gtgtccctga 
cagcttccca 
ttcttgaaga 
aatggtttct 
tttatttttc 
gcttcaataa 
tccctttttt 



5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
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gcggcatttt gccttcctgt ttttgctcac 
gaagatcagt tgggtgcacg agtgggttac 
cttgagagtt ttcgccccga agaacgtttt 
tgtggcgcgg tattatcccg tgttgacgcc 
tattctcaga atgacttggt tgagtactca 
atgacagtaa gagaattatg cagtgctgcc 
ttacttctga caacgatcgg aggaccgaag 
gatcatgtaa ctcgccttga tcgttgggaa 
gagcgtgaca ccacgatgcc tgcagcaatg 
gaactactta ctctagcttc ccggcaacaa 
gcaggaccac ttctgcgctc ggcccttccg 
gccggtgagc gtgggtctcg cggtatcatt 
cgtatcgtag ttatctacac gacggggagt 
atcgctgaga taggtgcctc actgattaag 
tatatacttt agattgattt aaaacttcat 
ctttttgata atctcatgac caaaatccct 
gaccccgtag aaaagatcaa aggatcttct 
tgcttgcaaa caaaaaaacc accgctacca 
ccaactcttt ttccgaaggt aactggcttc 
ctagtgtagc cgtagttagg ccaccacttc 
gctctgctaa tcctgttacc agtggctgct 
ttggactcaa gacgatagtt accggataag 
tgcacacagc ccagcttgga gcgaacgacc 
ctatgagaaa gcgccacgct tcccgaaggg 
agggtcggaa caggagagcg cacgagggag 
agtcctgtcg. ggtttcgcca cctctgactt 
gggcggagcc tatggaaaaa cgccagcaac 
tggccttttg ctcacatgtt ctttcctgcg 
accgcctttg agtgagctga taccgctcgc 
gtgagcgagg aagcggaaga gcgcctgatg 
atttcacacc gcatatggtg cactctcagt 
cagtatctgc tccctgcttg tgtgttggag 
gctacaacaa ggcaaggctt gaccgacaat 
tttgcgctgc ttcgcgatgt acgggccaga 
tttaggcgaa aagcggggct tcggttgtac 
tttcgctttt gcatagggag ggggaaatgt 
acatggtaac gatgagttag caacatgcct 
gattggtgga agtaaggtgg tacgatcgtg 
catggattgg acgaaccact 



ccagaaacgc tggtgaaagt aaaagatgct 9360 
atcgaactgg atctcaacag cggtaagatc 9420 
ccaatgatga gcacttttaa agttctgcta 9480 
gggcaagagc aactcggtcg ccgcatacac 9540 
ccagtcacag aaaagcatct tacggatggc 9600 
ataaccatga gtgataacac tgcggccaac 9660 
gagctaaccg cttttttgca caacatgggg 9720 
ccggagctga atgaagccat accaaacgac 9780 
gcaacaacgt tgcgcaaact attaactggc 9840 
ttaatagact ggatggaggc ggataaagtt 9900 
gctggctggt ttattgctga taaatctgga 9960 
gcagcactgg ggccagatgg taagccctcc 10020 
caggcaacta tggatgaacg aaatagacag 1008 0 
cattggtaac tgtcagacca agtttactca 1014 0 
ttttaattta aaaggatcta ggtgaagatc 10200 
taacgtgagt tttcgttcca ctgagcgtca 10260 
tgagatcctt tttttctgcg cgtaatctgc 10320 
gcggtggttt gtttgccgga tcaagagcta 1038 0 
agcagagcgc agataccaaa tactgtcctt 1044 0 
aagaactctg tagcaccgcc tacatacctc 10500 
gccagtggcg ataagtcgtg tcttaccggg 10560 
gcgcagcggt cgggctgaac ^gggggttcg 10620 
tacaccgaac tgagatacct acagcgtgag 1068 0 
agaaaggcgg acaggtatcc ggtaagcggc 1074 0 
cttccagggg gaaacgcctg gtatctttat 10800 
gagcgtcgat ttttgtgatg ctcgtcaggg 10860 
gcggcctttt tacggttcct ggccttttgc 10920 
ttatcccctg attctgtgga taaccgtatt 10980 
cgcagccgaa cgaccgagcg cagcgagtca 1104 0 
cggtattttc tccttacgca tctgtgcggt 11100 
acaatctgct ctgatgccgc atagttaagc 11160 
gtcgctgagt agtgcgcgag caaaatttaa 11220 
tgcatgaaga atctgcttag ggttaggcgt 11280 
tatacgcgta tctgagggga ctagggtgtg 1134 0 
gcggttagga gtcccctcag gatatagtag 11400 
agtcttatgc aatacacttg tagtcttgca 11460 
tacaaggaga gaaaaagcac cgtgcatgcc 11520 
ccttattagg aaggcaacag acgggtctga 11580 

11600 



<220> 

<223> Description of Artificial Sequence: plasiaid 
<400> 49 



<210> 50 
<211> 8238 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: piasmid 

gjggccgcca tcatcaataa tataccttat tttggattga agccaatatg ataatgaggg 60 
ggtggagttt gtgacgtggc gcggggcgtg ggaacggggc gggtgacgta ^^^^^gtggc 120 
ggaagtgtga igttgcaagt gtggcggaac acatgtaagc gacggatgtg gcaaaagtga 180 
cgtttttggt g?gcgccggt gtacacagga agtgacaatt ttcgcgcggt tttaggcgga 24 0 
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tgttgtagta 

taagaggaag 

gccgcgggga 

ccagatatac 

tgtacgcggt 

aatgtagtct 

tgccttacaa 

tcgtgcctta 

ccgcattgca 

attcaccaca 

gcgggggcca 

cttggatcgg 

tccgcatcga 

tagaactagt 

gcacctcttt 

cgcgtctgcg 

gcgccctcgc 

actgcgggtt 

ggtggccctg 

ggtgctgggg 

gggtgagata 

catgccttat 

ttcacatgcc 

cctgtgctac 

gttcgtggcc 

ggaggacaga 

ggctatgctg 

gcagggcggc 

ccagggtgcc 

taccctgttt 

ctgggccttg 

cgaccaatcg 

gacccacgtc 

ccgggagatg. 

ataccgtcga 

aaatttcaca 

caatgtatct 

gacccgcacc 

gatgctggat 

tgagtttggc 

aagggtggga 

agccgccgcc 

aacgcgcatg 

tcgccccgtc 

gttggagact 

gactgacttt 

cgatgacaag 

cgtttctcag 

tcccaatgcg 

agtgtcttgc 

tcggtcgttg 

cagatacatg 

ctgcggggtg 

aatgtctttc 

gcggttaagc 

taggttggct 

cacagtgtat 

gaacttggag 

aatgggccca 

gtgttccagg 

ctgcggtata 

ccacgctttg 



aatttgggcg 
tgaaatctga 
ctttgaccgt 
gcgtatctga 
taggagtccc 
tatgcaatac 
ggagagaaaa 
ttaggaaggc 
gagatattgt 
ttggtgtgca 
gctgttgggc 
aaacccgtcg 
ccggatcgga 
ggatcccccg 
ggcaagcgcc 
ttcgaccagg 
cggcagcaag 
tatatagacg 
ggttcgcgcg 
gcttccgaga 
tcggccgggg 
gccgtgaccg 
ccgcccccgg 
ccggccgcgc 
ctcatcccgc 
cacatcgacc 
gccgcgat tc 
gggtcgtggt 
gagccccaga 
cgggcccccg 
gacgtcttgg 
cccgccggct 
accaccccag 
ggggaggcta 
aacttgttta 
aataaagcat 
tatcatgtct 
aggtgcagac 
gtgaccgagg 
tctagcgatg 
aagaatatat 
gccatgagca 
cccccatggg 
ctgcccgcaa 
gcagcctccg 
gctttcctga 
ttgacggctc 
cagctgttgg 
gtttaaaaca 
tgtctttatt 
agggtcctgt 
ggcataagcc 
gtgttgtaga 
agtagcaagc 
tgggatgggt 
atgttcccag 
ccggtgcact 
acgcccttgt 
cgggcggcgg 
atgagatcgt 
atggttccat 
agttcagatg 



taaccgagta 
ataattttgt 
ttacgtggag 
ggggactagg 
ctcaggatat 
tcttgtagtc 
agcaccgtgc 
aacagacggg 
atttaagtgc 
cctccggccc 
tcgcggttga 
gcctccgaac 
aaacctctcg 
ggctgcagga 
ttgtagaagc 
ctgcgcgttc 
aagccacgga 
gtcctcacgg 
acgatatcgt 
caatcgcgaa 
acgcggcggt 
acgccgt tct 
ccctcaccct 
gataccttat 
cgaccttgcc 
gcctggccaa 
gccgcgttta 
gggaggattg 
gcaacgcggg 
agttgctggc 
ccaaacgcct 
gccgggacgc 
gctccatacc 
actgactcga 
ttgcagctta 
ttttttcact 
ggatccgacc 
cctgcgagtg 
agctgaggcc 
aagatacaga 
aaggtggggg 
ccaactcgtt 
ccggggtgcg 
actctactac 
ccgccgcttc 
gcccgcttgc 
ttttggcaca 
atctgcgcca 
taaataaaaa 
taggggtttt 
gtattttttc 
cgtctctggg 
tgatccagtc 
tgattgccag 
gcatacgtgg 
ccatatccct 
tgggaaattt 
gacctccaag 
cctgggcgaa 
cataggccat 
ccggcccagg 
gggggatcat 



agatttggcc 
gttactcata 
actcgcccag 
gtgtgtttag 
agtagtttcg 
ttgcaacatg 
atgccgattg 
tctgacatgg 
ctagctcgat 
atatggccac 
ggacaaactc 
ggtactccgc 
agaaaggcgt 
attcgatgat 
gcgtatggct 
tcgcggccat 
agtccgcctg 
gatggggaaa 
ctacgtaccc 
catctacacc 
ggtaatgaca 
ggctcctcat 
catcttcgac 
gggcagcatg 
cggcacaaac 
acgccagcgc 
cgggctgctt 
gggacagctt 
cccacgaccc 
ccccaacggc 
ccgtcccatg 
cctgctgcaa 
gacgatctgc 
gaagcttggg 
taatggttac 
gcattctagt 
tcggatctgg 
tggcggtaaa 
cgatcacttg 
ttgaggtact 
tcttatgtag 
tgatggaagc 
tcagaatgtg 
cttgacctac 
agccgctgca 
aagcagtgca 
attggattct 
gcaggtttct 
accagactct 
gcgcgcgcgg 
caggacgtgg 
gtggaggtag 
gtagcaggag 
gggcaggccc 
ggatatgaga 
ccggggattc 
gtcatgtagc 
attttccatg 
gatatttctg 
ttttacaaag 
ggcgtagtta 
gtctacctgc 



attttcgcgg 
gcgcgtaata 
ggcgcgcccc 
gcgaaaagcg 
cttttgcata 
gtaacgatga 
gtggaagtaa 
attggacgaa 
acaataaacg 
tctcttccgc 
ttcgcggtct 
cgccgaggga 
gtaaccagtc 
cttggtggcg 
tcgtacccct 
agcaaccgac 
gagcagaaaa 
accaccacca 
gagccgatga 
acacaacacc 
agcgcccaga 
gtcggggggg 
cgccatccca 
accccccagg 
atcgtgttgg 
cccggcgagc 
gccaatacgg 
tcggggacgg 
catatcgggg 
gacctgtata 
cacgtcttta 
cttacctccg 
gacctggcgc 
cccatcgatc 
aaataaagca 
tgtggtttgt 
aaggtgctga 
catattagga 
gtgctggcct 
gaaatgtgtg 
ttttgtatct 
attgtgagct 
atgggctcca 
gagaccgtgt 
gccaccgccc 
gcttcccgtt 
ttgacccggg 
gccctgaagg 
gtttggattt 
taggcccggg 
taaaggtgac 
caccactgca 
cgctgggcgt 
ttggtgtaag 
tgcatcttgg 
atgttgtgca 
ttagaaggaa 
cattcgtcca 
ggatcactaa 
cgcgggcgga 
ccctcacaga 
ggg^cgatga 



gaaaactgaa 
tttgtctagg 
gatgtacggg 
gggcttcggt 
gggaggggga 
gttagcaaca 
ggtggtacga 
ccactgaatt 
ccatttgacc 
atcgctgtct 
ttccagtact 
cctgagcgag 
acagtcgctc 
tgaaactccc 
gccatcaaca 
gtacggcgtt 
tgcccacgct 
cgcaactgct 
cttactggca 
gcctcgacca 
taacaatggg 
aggctgggag 
tcgccgccct 
ccgtgctggc 
gggcccttcc 
ggcttgacct 
tgcggtatct' 
ccgtgccgcc 
acacgttatt 
acgtgtttgc 
tcctggatta 
ggatggtcca 
gcacgtttgc 
aagcttatcg 
atagcatcac 
ccaaactcat 
ggtacgatga 
accagcctgt 
gcacccgcgc 
ggcgtggctt 
gttttgcagc 
catatttgac 
gcattgatgg 
ctggaacgcc 
gcgggattgt 
catccgcccg 
aacttaatgt 
cttcctcccc 
ggatcaagca 
accagcggtc 
tctggatgtt 
gagcttcatg 
ggtgcctaaa 
tgtttacaaa 
actgtatttt 
gaaccaccag 
atgcgtggaa 
taatgatggc 
cgtcatagtt 
gggtgccaga 
tttgcatttc 
agaaaacggt 



300 

360 

420 

4B0 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 
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ttccggggta ggggagatca gctgggaaga 
gcagccggtg ggcccgtaaa tcacacctat 
gcagctgccg tcatccctga gcaggggggc 
gttttccctg accaaatccg ccagaaggcg 
ggaagcaaag tttttcaacg gtttgagacc 
accaagcagt tccaggcggt cccacagctc 
catatctcct cgtttcgcgg gttggggcgg 
tccagacggg ccagggtcat gtctttccac 
gtcacggtga aggggtgcgc tccgggctgc 
ctgctggtgc tgaagcgctg ccggtcttcg 
atggtgtcat agtccagccc ctccgcggcg 
gaggcgccgc acgaggggca gtgcagactt 
accgattccg gggagtaggc atccgcgccg 
agccaggtg^ gctctggccg ttcggggtca 
cgtttcttac ctctggtttc catgagccgg 
gtgtccccgt atacagactt gagaggcctg 
tatagaaact cggaccactc tgagacaaag 
aagtgggagg ggtagcggtc gttgtccact 
cacatgtcgc cctcttcggc atcaaggaag 
ccgggtgttc ctgaaggggg gctataaaag 
tccgcatcgc tgtctgcgag ggccagctgt 
atgacttctg cgctaagatt gtcagtttcc 
cccgcggtga tgcctttgag ggtggccgca 
ttgtcaagct tcgagggggg gcccggtacc 
tgcgcgcttg gcgtaatcat ggtcatagct 
aattccacac aacatacgag ccggaagcat 
gagctaactc acattaattg cgttgcgctc 
gtgccagctg cattaatgaa tcggccaacg 
ctcttccgct tcctcgctca ctgactcgct 
atcagctcac tcaaaggcgg taatacggtt 
gaacatgtga gcaaaaggcc agcaaaaggc 
gtttttccat aggctccgcc cccctgacga 
gtggcgaaac ccgacaggac tataaagata 
gcgctctcct gttccgaccc tgccgcttac 
aagcgtggcg ctttctcata gctcacgctg 
ctccaagctg ggctgtgtgc acgaaccccc 
taactatcgt cttgagtcca acccggtaag 
tggtaacagg attagcagag cgaggtatgt 
gcctaactac ggctacacta gaaggacagt 
taccttcgga aaaagagttg gtagctcttg 
tggttttttt gtttgcaagc agcagattac 
tttgatcttt tctacggggt ctgacgctca 
ggtcatgaga ttatcaaaaa ggatcttcac 
taaatcaatc taaagtatat atgagtaaac 
tgaggcacct atctcagcga tctgtctatt 
cgtgtagata actacgatac gggagggctt 
gcgagaccca cgctcaccgg ctccagattt 
cgagcgcaga agtggtcctg caactttatc 
ggaagctaga gtaagtagtt cgccagttaa 
aggcatcgtg gtgtcacgct cgtcgtttgg 
atcaaggcga gttacatgat cccccatgtt 
tccgatcgtt gtcagaagta agttggccgc 
gcataattct cttactgtca tgccatccgt 
aaccaagtca ttctgagaat agtgtatgcg 
acgggataat accgcgccac atagcagaac 
ttcggggcga aaactctcaa ggatcttacc 
tcgtgcaccc aactgatctt cagcatcttt 
aacaggaagg caaaatgccg caaaaaaggg 
catactcttc ctttttcaat attattgaag 
atacatattt gaatgtattt agaaaaataa 
aaaagtgcca cctgacgcgc cctgtagcgg 
gcgcagcgtg accgctacac ttgccagcgc 



aagcaggttc ctgagcagct gcgacttacc 4 020 
taccggctgc aactggtagt taagagagct 4 080 
cacttcgtta agcatgtccc tgactcgcat 4140 
ctcgccgccc agcgatagca gttcttgcaa 4 200 
gtccgccgta ggcatgcttt tgagcgtttg 4 260 
ggtcacctgc tctacggcat ctcgatccag 4 320 
ctttcgctgt acggcagtag tcggtgctcg 4 380 
gggcgcaggg tcctcgtcag cgtagtctgg 4 440 
gcgctggcca gggtgcgctt gaggctggtc 4 500 
ccctgcgcgt cggccaggta gcatttgacc 4 560 
tggcccttgg cgcgcagctt gcccttggag 4 620 
ttgagggcgt agagcttggg cgcgagaaat 4 680 
caggccccgc agacggtctc gcattccacg 4740 
aaaaccaggt ttcccccatg ctttttgatg 4 800 
tgtccacgct cggtgacgaa aaggctgtcc 4860 
tcctcgagcg gtgttccgcg gtcctcctcg 4 920 
gctcgcgtcc aggccagcac gaaggaggct 4 980 
agggggtcca ctcgctccag ggtgtgaaga 504 0 
gtgattggtt tgtaggtgta ggccacgtga 5100 
ggggtggggg cgcgttcgtc ctcactctct 5160 
tggggtgagt actccctctg aaaagcgggc 5220 
aaaaacgagg aggatttgat ^ttcacctgg 5280 
tccatctggt cagaaaagac aatctttttg 5340 
cagcttttgt tccctttagt gagggttaat 5400 
gtttcctgtg tgaaattgtt atccgctcac 54 60 
aaagtgtaaa gcctggggtg cctaatgagt 5520 
actgcccgct ttccagtcgg gaaacctgtc 5580 
cgcggggaga ggcggtttgc gtattgggcg 564 0 
gcgctcggtc gttcggctgc ggcgagcggt 5700 
atccacagaa tcaggggata acgcaggaaa 57 60 
caggaaccgt aaaaaggccg cgttgctggc 5820 
gcatcacaaa aatcgacgct caagtcagag 5880 
ccaggcgttt ccccctggaa gctccctcgt 5940 
cggatacctg tccgcctttc tcccttcggg 6000 
taggtatctc agttcggtgt aggtcgttcg 6060 
cgttcagccc gaccgctgcg ccttatccgg 6120 
acacgactta tcgccactgg cagcagccac 6180 
aggcggtgct acagagttct tgaagtggtg 6240 
atttggtatc tgcgctctgc tgaagccagt 6300 
atccggcaaa caaaccaccg ctggtagcgg 6360 
gcgcagaaaa aaaggatctc aagaagatcc 64 20 
gtggaacgaa aactcacgtt aagggatttt 6480 
ctagatcctt ttaaattaaa aatgaagttt 6540 
ttggtctgac agttaccaat gcttaatcag 6600 
tcgttcatcc atagttgcct gactccccgt 6660 
accatctggc cccagtgctg caatgatacc 6720 
atcagcaata aaccagccag ccggaagggc 6780 
cgcctccatc cagtctatta attgttgccg 6840 
tagtttgcgc aacgttgttg ccattgctac 6900 
tatggcttca ttcagctccg gttcccaacg 6960 
gtgcaaaaaa gcggttagct ccttcggtcc 7020 
agtgttatca ctcatggtta tggcagcact 7080 
aagatgcttt tctgtgactg gtgagtactc 7140 
gcgaccgagt tgctcttgcc cggcgtcaat 7200 
tttaaaagtg ctcatcattg gaaaacgttc 7260 
gctgttgaga tccagttcga tgtaacccac 7320 
tactttcacc agcgtttctg ggtgagcaaa 7380 
aataagggcg acacggaaat gttgaatact 7440 
catttatcag ggttattgtc tcatgagcgg 7500 
acaaataggg gttccgcgca catttccccg 7560 
cgcattaagc gcggcgggtg tggtggttac 7620 
cctagcgccc gctcctttcg ctttcttccc 7680 
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ttcctttctc gccacgttcg ccggctttcc 

agggttccga tttagtgctt tacggcacct 

ttcacgtagt gggccatcgc cctgatagac 

gttctttaat agtggactct tgttccaaac 

ttcttttgat ttataaggga ttttgcgatt 

taacaaaaat ttaacgcgaa ttttaacaaa 

tcaggctgcg caactgttgg gaagggcgat 

tggcgaaagg gggatgtgct gcaaggcgat 

cacgacgttg taaaacgacg gccagtgagc 

tggagctcca ccgcggtg 



-50- 

ccgtcaagct ctaaatcggg ggctcccttt 774 0 
cgaccccaaa aaacttgatt agggtgatgg 7800 
ggtttttcgc cctttgacgt tggagtccac 78 60 
tggaacaaca ctcaacccta tctcggtcta 7920 
tcggcctatt ggttaaaaaa tgagctgatt 7980 
atattaacgc ttacaatttc cattcgccat 8040 
cggtgcgggc ctcttcgcta ttacgccagc 8100 
taagttgggt aacgccaggg ttttcccagt 8160 
gcgcgtaata cgactcacta tagggcgaat 8220 

8238 



<210> 51 

<211> 11 

<212> DMA 

<213> adenovirus 

<400> 51 
cgcggatccc g 



<210> 52 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 52 

ctgacaaact cagatcttgt ttattg 26 



<210> 53 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 53 

gtcgactcta gaggatccag a 21 



<210> 54 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 54 

ccggactcta gatggcaacc atggcgctac 



<210> 55 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer 

<400> 55 3-,^ 
ggaggggaag cttggccctc agccagcctc t 

<210> 56 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 56 ^ 4.4. AO 

tgtcttggat ccaagatgaa gcgcgcccgc cccag^cgaag atgacttc 

<210> 5*7 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 57 28 
aaacacggcg gccgctcttt cattcttg 

<210> 58 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Description of Artificial Sequence: primer 

<400> 58 25 
cgcgctgact cttaggacta gtttc 

<210> 59 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 59 3-j 
gcgcttaatt aacatcatca ataatatacc ttatttt 

<210> 60 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
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<400> 60 

tgaagcgcgc aagaccgtct gaag 



<210> 61 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 61 

cataacactg cagattcttt attcttgg 



<210> 62 

<211> 47 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 62 

ggtacacagg aaacaggagg ttccggaggt ggaggagaca caactcc 



<210> 63 
<211> 6 
<212> PRT 

<213> Artificial Sequence . 
<220> 

<223> Description of Artificial Sequence: synthetic peptide 
<400> 63 

Gly Gly Ser Gly Gly Gly 
1 5 



<210> 64 
<211> 7231 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: plasmid 



<400> 64 

ctgctccctg 

acaaggcaag 

ctgcttcgcg 

tagtaatcaa 

cttacggtaa 

atgacgtatg 

tatttacggt 

cctattgacg 

tgggactttc 

cggttttggc 

ctccacccca 

aaatgtcgta 



cttgtgtgtt 
gcttgaccga 
atgtacgggc 
ttacggggtc 
atggcccgcc 
ttcccatagt 
aaactgccca 
tcaatgacgg 
ctacttggca 
agtacatcaa 
ttgacgtcaa 
acaactccgc 



ggaggtcgct 
caattgcatg 
cagatatacg 
attagttcat 
tggctgaccg 
aacgccaata 
cttggcagta 
taaatggccc 
gtacatctac 
tgggcgtgga 
tgggagtttg 
cccattgacg 



gagtagtgcg 
aagaatctgc 
cgttgacatt 
agcccatata 
cccaacgacc 
gggactttcc 
catcaagtgt 
gcctggcatt 
gtattagtca 
tagcggtttg 
ttttggcacc 
caaatgggcg 



cgagcaaaat 
ttagggttag 
gattattgac 
tggagttccg 
cccgcccatt 
attgacgtca 
atcatatgcc 
atgcccagta 
tcgctattac 
actcacgggg 
aaaatcaacg 
gtaggcgtgt 



ttaagctaca 60 
gcgttttgcg 120 
tagttattaa 180 
cgttacataa 240 
gacgtcaata 300 
atgggtggac 360 
aagtacgccc 420 
catgacctta 480 
catggtgatg 540 
atttccaagt 600 
ggactttcca 660 
acggtgggag 720 
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gtctatataa gcagagctct ctggctaact 
attaatacga ctcactatag ggagacccaa 
ccgagctcgg atccactctc ttccgcatcg 
tactccctct gaaaagcggg catgacttct 
gaggatttga tattcacctg gcccgcggtg 
tcagaaaaga caat.cttttt gttgtcaagc 
gacagcaact tggcgatgga gcgcagggtt 
gccgcgatgt ttagctgcac gtattcgcgc 
gtgcgctcgt cgggcaccag gtgcacgcgc 
acgctggtgg ctacctctcc gcgtaggcgc 
cgcgagcaga atggcggtag ggggtctagc 
gtaaagaccc cgggcagcag gcgcgcgtcg 
agcgcctgct gccatgcgcg ggcggcaagc 
catggcatgg ggtgggtgag cgcggaggcg 
ggctctctga gtattccaag atatgtaggg 
acgtaatcgt atagttcgtg cgagggagcg 
ggctgctctg ctcggaagac tatctgcctg 
ggacgctgga agacgttgaa gctggcgtct 
gcgtaggagt cgcgcagctt gttgaccagc 
tagtccaggg tttccttgat gatgtcatac 
cggttgagga caaactcttc gcggtctttc 
tccgaacgag atccgtactc cgccgccgag 
ggaaaacctc tcgagaaagg cgtctaacca 
gcccgcccca gcgaagatga cttcaacccc 
aatatcccct tcctcactcc cccctttgtc 
ggggtactgt cactcaaact ggctgatcca 
aaggtgggag gtggtctcac tttgcaagat 
ctgcaagtta atactgataa aaaacttgag 
gctaataaac ttagtttaaa agtaggacat 
gcggggttaa aagatttaat tggcaaactt 
gaaaatttag aaaatacaga tggtagcagc 
gaagggttga catttgacaa tgatggatac 
cgcacacttt ggacaacacc agacacatct 
tctaaactca ctttggtact tacaaagtgt 
attgtggtcg caggaaagta ccacatcata 
tttactatta aactgctatt taataagaac 
aaagcttatt ggaactttag aagtggaaat 
attggtttta tgcctaattt ggtagcgtat 
agagacatag tttatggaac tatatatctt 
aaaactacct ttaaccaaga aactggatgt 
tccaaaacct atgaaaatgt tgaatttgaa 
caagaatgaa agagcggccg ctcgagtcta 
cgactgtgcc ttctagttgc cagccatctg 
ccctggaagg tgccactccc actgtccttt 
gtctgagtag gtgtcattct attctggggg 
attgggaaga caatagcagg catgctgggg 
aaagaaccag ctggggctct agggggtatc 
cggcgggtgt ggtggttacg cgcagcgtga 
ctcctttcgc tttcttccct tcctttctcg. 
taaatcgggg catcccttta gggttccgat 
aacttgatta gggtgatggt tcacgtagtg 
ctttgacgtt ggagtccacg ttctttaata 
tcaaccctat ctcggtctat tcttttgatt 
ggttaaaaaa tgagctgatt taacaaaaat 
tcagttaggg tgtggaaagt ccccaggctc 
atctcaatta gtcagcaacc aggtgtggaa 
tgcaaagcat gcatctcaat tagtcagcaa 
cgcccctaac tccgcccagt tccgcccatt 
tttatgcaga ggccgaggcc gcctctgcct 
tttttggagg cctaggcttt tgcaaaaagc 
ctgatcagca cgtgttgaca attaatcatc 
caaggtgagg aactaaacca tggccaagtt 
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agagaaccca ctgcttactg gcttatcgaa 780 
gctggctagc gtttaaactt aagcttggta 840 
ctgtctgcga gggccagctg ttggggtgag 900 
gcgctaagat tgtcagtttc caaaaacgag 960 
atgcctttga gggtggccgc atccatctgg 1020 
ttggtggcaa acgacccgta gagggcgttg 1080 
tggtttttgt cgcgatcggc gcgctccttg 1140 
gcaacgcacc gccattcggg aaagacggtg 1200 
caaccgcggt tgtgcagggt gacaaggtca 1260 
tcgttggtcc agcagaggcg gccgcccttg 1320 
tgcgtctcgt ccggggggtc tgcgtccacg 1380 
aagtagtcta tcttgcatcc ttgcaagtct 14 4 0 
gcgcgctcgt atgggttgag tgggggaccc 1500 
tacatgccgc aaatgtcgta aacgtagagg 1560 
tagcatcttc caccgcggat gctggcgcgc 1620 
aggaggtcgg gaccgaggtt gctacgggcg 1680 
aagatggcat gtgagttgga tgatatggtt 174 0 
gtgagaccta ccgcgtcacg cacgaaggag 1800 
tcggcggtga cctgcacgtc tagggcgcag 18 60 
ttatcctgtc cctttttttt ccacagctcg 1920 
cagtactctt ggatcggaaa cccgtcggcc 1980 
ggacctgagc gagtccgcat ^gaccggatc 204 0 
gtcacagtcg caagatccaa gatgaagcgc 2100 
gtctacccct atggctacgc gcggaatcag 2160 
tcctccgatg gattcaaaaa cttcccccct 2220 
atcaccatta ccaatgggga tgtatccctc 2280 
ggaagcctaa ctgtaaaccc taaggctcca 234 0 
cttgcatatg ataatccatt tgaaagtagt 2400 
ggattaaaag tattagatga aaaaagtgct 2460 
gtggttttaa caggaaaagg aataggcact 2520 
agaggaattg gtataaatgt aagagcaaga 2580 
ttggtagcat ggaacccaaa gtatgacacg 2640 
ccaaactgca caattgctca agataaggac 2700 
ggaagtcaaa tattagctaa tgtgtctttg 2760 
aataataaga caaatccaaa aataaaaagt 2820 
ggagtgcttt tagacaactc aaatcttgga 2880 
tccaatgttt cgacagctta tgaaaaagca 2940 
ccaaaaccca gtaattctaa aaaatatgca 3000 
ggtggaaaac ctgatcagcc agcagtcatt 3060 
gaatactcta tcacatttaa ctttagttgg 3120 
accacctctt ttaccttctc ctatattgcc 3180 
gagggcccgt ttaaacccgc tgatcagcct 3240 
ttgtttgccc ctcccccgtg ccttccttga 3300 
cctaataaaa tgaggaaatt gcatcgcatt 3360 
gtggggtggg gcaggacagc aagggggagg 34 20 
atgcggtggg ctctatggct tctgaggcgg 3480 
cccacgcgcc ctgtagcggc gcattaagcg 3540 
ccgctacact tgccagcgcc ctagcgcccg 3600 
ccacgttcgc cggctttccc cgtcaagctc 3660 
ttagtgcttt acggcacctc gaccccaaaa 3720 
ggccatcgcc ctgatagacg gtttttcgcc 3780 
gtggactctt gttccaaact ggaacaacac 384 0 
tataagggat tttggggatt tcggcctatt 3900 
ttaacgcgaa ttaattctgt ggaatgtgtg 3960 
cccaggcagg cagaagtatg caaagcatgc 4020 
agtccccagg ctccccagca ggcagaagta 4080 
ccatagtccc gcccctaact ccgcccatcc 4140 
ctccgcccca tggctgacta atttttttta 4200 
ctgagctatt ccagaagtag tgaggaggct 4260 
tcccgggagc ttgtatatcc attttcggat 4320 
ggcatagtat atcggcatag tataatacga 4380 
gaccagtgcc gttccggtgc tcaccgcgcg 44 40 
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cgacgtcgcc 
ggaggacgac 
ggaccaggtg 
gtacgccgag 
gaccgagatc 
ctgcgtgcac 
cgccgccttc 
cctccagcgc 
ttataatggt 
actgcattct 
gtcgacctct 
ttatccgctc 
tgcctaatga 
gggaaacctg 
gcgtattggg 
gcggcgagcg 
taacgcagga 
cgcgttgctg 
ctcaagtcag 
aagctccctc 
tctcccttcg 
gtaggtcgtt 
cgccttatcc 
ggcagcagcc 
cttgaagtgg 
gctgaagcca 
cgctggtagc 
tcaagaagat 
ttaagggatt 
aaaatgaagt 
atgcttaatc 
ctgactcccc 
tgcaatgata 
agccggaagg 
taattgttgc 
tgccattgct 
cggttcccaa 
ctccttcggt 
tatggcagca 
tggtgagtac 
cccggcgtca 
tggaaaacgt 
gatgtaaccc 
tgggtgagca 
atgttgaata 
tctcatgagc 
cacatttccc 



ggagcggtcg 
ttcgccggtg 
gtgccggaca 
tggtcggagg 
ggcgagcagc 
ttcgtggccg 
tatgaaaggt 
ggggatctca 
tacaaat:aaa 
agttgtggtt 
agctagagct 
acaattccac 
gtgagctaac 
tcgtgccagc 
cgctcttccg 
gtatcagctc 
aagaacatgt 
gcgtttttcc 
aggtggcgaa 
gtgcgctctc 
ggaagcgtgg 
cgctccaagc 
ggtaactatc 
actggtaaca 
tggcctaact 
gttaccttcg 
ggtggttttt 
cctttgatct 
ttggtcatga 
tttaaatcaa 
agtgaggcac 
gtcgtgtaga 
ccgcgagacc 
gccgagcgca 
cgggaagcta 
acaggcatcg 
cgatcaaggc 
cctccgatcg 
ctgcataatt 
tcaaccaagt 
atacgggata 
tcttcggggc 
actcgtgcac 
aaaacaggaa 
ctcatactct 
ggatacatat 
cgaaaagtgc 



agttctggac 
tggtccggga 
acaccctggc 
tcgtgtccac 
cgtgggggcg 
aggagcagga 
tgggcttcgg 
tgctggagtt 
gcaatagcat 
tgtccaaact 
tggcgtaatc 
acaacatacg 
tcacattaat 
tgcattaatg 
cttcctcgct 
actcaaaggc 
gagcaaaagg 
ataggctccg 
acccgacagg 
ctgttccgac 
cgctttctca 
tgggctgtgt 
gtcttgagtc 
ggattagcag 
acggctacac 
gaaaaagagt 
ttgtttgcaa 
tttctacggg 
gattatcaaa 
tctaaagtat 
ctatctcagc 
taactacgat 
cacgctcacc 
gaagtggtcc 
gagtaagtag 
tggtgtcacg 
gagttacatg 
ttgtcagaag 
ctcttactgt 
cattctgaga 
ataccgcgcc 
gaaaactctc 
ccaactgatc 
ggcaaaatgc 
tcctttttca 
ttgaatgtat 
cacctgacgt 



cgaccggctc 
cgacgtgacc 
ctgggtgtgg 
gaacttccgg 
ggagttcgcc 
ctgacacgtg 
aatcgttttc 
cttcgcccac 
cacaaatttc 
catcaatgta 
atggtcatag 
agccggaagc 
tgcgttgcgc 
aatcggccaa 
cactgactcg 
ggtaatacgg 
ccagcaaaag 
cccccctgac 
actataaaga 
cctgccgctt 
atgctcacgc 
gcacgaaccc 
caacccggta 
agcgaggtat 
tagaaggaca 
tggtagctct 
gcagcagatt 
gtctgacgct 
aaggatcttc 
atatgagtaa 
gatctgtcta 
acgggagggc 
ggctccagat 
tgcaacttta 
ttcgccagtt 
ctcgtcgttt 
atcccccatg 
taagttggcc 
catgccatcc 
atagtgtatg 
acatagcaga 
aaggatctta 
ttcagcatct 
cgcaaaaaag 
atattattga 
ttagaaaaat 
c 



gggttctccc 
ctgttcatca 
gtgcgcggcc 
gacgcctccg 
ctgcgcgacc 
ctacgagatt 
cgggacgccg 
cccaacttgt 
acaaataaag 
tcttatcatg 
ctgtttcctg 
ataaagtgta 
tcactgcccg 
cgcgcgggga 
ctgcgctcgg 
ttatccacag 
gccaggaacc 
gagcatcaca 
taccaggcgt 
accggatacc 
tgtaggtatc 
cccgttcagc 
agacacgact 
gtaggcggtg 
gtatttggta 
tgatccggca 
acgcgcagaa 
cagtggaacg 
acctagatcc 
acttggtctg 
tttcgttcat 
ttaccatctg 
ttatcagcaa 
tccgcctcca 
aatagtttgc 
ggtatggctt 
ttgtgcaaaa 
gcagtgttat 
gtaagatgct 
cggcgaccga 
actttaaaag 
ccgctgttga 
tttactttca 
ggaataaggg 
agcatttatc 
aaacaaatag 



gggacttcgt 
gcgcggtcca 
tggacgagct 
ggccggccat 
cggccggcaa 
tcgattccac 
gctggatgat 
ttattgcagc 
catttttttc 
tctgtatacc 
tgtgaaattg 
aagcctgggg 
ctttccagtc 
gaggcggttt 
tcgttcggct 
aatcagggga 
gtaaaaaggc 
aaaatcgacg 
ttccccctgg 
tgtccgcctt 
tcagttcggt 
^cgaccgctg 
tatcgccact 
ctacagagtt 
tctgcgctct 
aacaaaccac 
aaaaaggatc 
aaaactcacg 
ttttaaatta 
acagttacca 
ccatagttgc 
gccccagtgc 
taaaccagcc 
tccagtctat 
gcaacgttgt 
cattcagctc 
aagcggttag 
cactcatggt 
tttctgtgac 
gttgctcttg 
tgctcatcat 
gatccagttc 
ccagcgtttc 
cgacacggaa 
agggttattg 
gggttccgcg 



4500 
4560 
4620 
4680 
4740 
4600 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7231 



<210> 65 
<211> 8484 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid 

<400> 65 ^ 

ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg cgagcaaaat ttaagctaca 60 

acaaggcaag gcttgaccga caattgcatg aagaatctgc ttagggttag gcgttttgcg 120 

ctgcttcgcg atgtacgggc cagatatacg cgttgacatt gattattgac tagttattaa 180 

tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg cgttacataa ZQU 
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cttacggtaa 
atgacgtatg 
tatttacggt 
cctattgacg 
tgggactttc 
cggttttggc 
ctccacccca 
aaatgtcgta 
gtctatataa 
attaatacga 
ccgagctcgg 
tactccctct 
gaggatttga 
tcagaaaaga 
gacagcaact 
gccgcgatgt 
gtgcgctcgt 
acgctggtgg 
cgcgagcaga 
gtaaagaccc 
agcgcctgct 
catggcatgg 
ggctctctga 
acgtaatcgt 
ggctgctctg 
ggacgctgga 
gcgtaggagt 
tagtccaggg 
cggttgagga 
tccgaacgag 
ggaaaacctc 
gcaagaccgt 
ccaactgtgc 
cctggggtac 
ctcaaaatgg 
accactgtga 
cccctcacag 
ggcaacacac 
attgccaccc 
cccct caeca 
gccactggta 
ggactaaagt 
actggtccag 
ggttttgatt 
caaaacagac 
ctaagactag 
aacaaaggcc 
agcactgcca 
cttgaatttg 
ggcctagaat 
gacagcacag 
acaccagctc 
gtcttaacaa 
agtttggctc 
aatggagtgc 
gatcttactg 
tatccaaaat 
ggagacaaaa 
gacacaactc 
attaatgaaa 
aagcggccgc 
actggtattc 



atggcccgcc 
ttcccatagt 
aaactgccca 
tcaatgacgg 
ctacttggca 
agtacatcaa 
ttgacgtcaa 
acaactccgc 
gcagagctct 
ctcactatag 
atccactctc 
gaaaagcggg 
tattcacctg 
caatcttttt 
tggcgatgga 
ttagctgcac 
cgggcaccag 
ctacctctcc 
atggcggtag 
cgggcagcag 
gccatgcgcg 
ggtgggtgag 
gtattccaag 
atagttcgtg 
ctcggaagac 
agacgttgaa 
cgcgcagctt 
tttccttgat 
caaactcttc 
atccgtactc 
tcgagaaagg 
ctgaagatac 
cttttcttac 
tctctttgcg 
gcaacggcct 
gcccacctct 
ttacctcaga 
tcaccatgca 
aaggacccct 
ccaccgatag 
gcttgggcat 
acggggctcc 
gtgtgactat 
cacaaggcaa 
gccttatact 
gacagggccc 
tttacttgtt 
aggggttgat 
gttcacctaa 
ttgattcaaa 
gtgccattac 
catctcctaa 
aatgtggcag 
caatatctgg 
tactaaacaa 
aaggcacagc 
ctcacggtaa 
ctaaacctgt 
caagtgcata 
tatttgccac 
tcgagtctag 
ttaactatgt 



tggctgaccg 
aacgccaata 
cttggcagta 
taaatggccc 
gtacatctac 
tgggcgtgga 
tgggagtttg 
cccattgacg 
ctggctaact 
ggagacccaa 
ttccgcatcg 
catgacttct 
gcccgcggtg 
gttgtcaagc 
gcgcagggtt 
gtattcgcgc 
gtgcacgcgc 
gcgtaggcgc 
ggggtctagc 
gcgcgcgtcg 
ggcggcaagc 
cgcggaggcg 
atatgtaggg 
cgagggagcg 
tatctgcctg 
gctggcgtct 
gttgaccagc 
gatgtcatac 
gcggtctttc 
cgccgccgag 
cgtctaacca 
cttcaacccc 
tcctcccttt 
cctatccgaa 
ctctctggac 
caaaaaaacc 
agccctaact 
atcacaggcc 
cacagtgtca 
cagtaccctt 
tgacttgaaa 
tttgcatgta 
taataatact 
tatgcaactt 
tgatgttagt 
tctttttata 
tacagcttca 
gtttgacgct 
tgcaccaaac 
caaggctatg 
agtaggaaac 
ctgtagacta 
tcaaatactt 
aacagttcaa 
ttccttcctg 
ctatacaaac 
aactgccaaa 
aacactaacc 
ctctatgtca 
atcctcttac 
cgataatcaa 
tgctcctttt 



cccaacgacc 
gggactttcc 
catcaagtgt 
gcctggcatt 
gtattagtca 
tagcggtttg 
ttttggcacc 
caaatgggcg 
agagaaccca 
gctggctagc 
ctgtctgcga 
gcgctaagat 
atgcctttga 
ttggtggcaa 
tggtttttgt 
gcaacgcacc 
caaccgcggt 
tcgttggtcc 
tgcgtctcgt 
aagtagtcta 
gcgcgctcgt 
tacatgccgc 
tagcatcttc 
aggaggtcgg 
aagatggcat 
gtgagaccta 
tcggcggtga 
ttatcctgtc 
cagtactctt 
ggacctgagc 
gtcacagtcg 
gtgtatccat 
gtatccccca 
cctctagtta 
gaggccggca 
aagtcaaaca 
gtggctgccg 
ccgctaaccg 
gaaggaaagc 
actatcactg 
gagcccattt 
acagacgacc 
tccttgcaaa 
aatgtagcag 
tatccgtttg 
aactcagccc 
aacaattcca 
acagccatag 
acaaatcccc 
gttcctaaac 
aaaaataatg 
aatgcagaga 
gctacagttt 
agtgctcatc 
gacccagaat 
gctgttggat 
agtaacattg 
attacactaa 
ttttcatggg 
actttttcat 
cctctggatt 
acgctatgtg 



cccgcccatt 
attgacgtca 
atcatatgcc 
atgcccagta 
tcgctattac 
actcacgggg 
aaaatcaacg 
gtaggcgtgt 
ctgcttactg 
gtttaaactt 
gggccagctg 
tgtcagtttc 
gggtggccgc 
acgacccgta 
cgcgatcggc 
gccattcggg 
tgtgcagggt 
agcagaggcg 
ccggggggtc 
tcttgcatcc 
atgggttgag 
aaatgtcgta 
caccgcggat 
gaccgaggtt 
gtgagttgga 
ccgcgtcacg 
cctgcacgtc 
cctttttttt 
ggatcggaaa 
gagtccgcat 
caagatccaa 
atgacacgga 
atgggtttca 
cctccaatgg 
accttacctc 
taaacctgga 
ccgcacctct 
tgcacgactc 
tagccctgca 
cctcaccccc 
atacacaaaa 
taaacacttt 
ctaaagttac 
gaggactaag 
atgctcaaaa 
acaacttgga 
aaaagcttga 
ccattaatgc 
tcaaaacaaa 
taggaactgg 
ataagctaac 
aagatgctaa 
cagttttggc 
ttattataag 
attggaactt 
ttatgcctaa 
tcagtcaagt 
acggtacaca 
actggtctgg 
acattgccca 
acaaaatttg 
gatacgctgc 



gacgtcaata 
atgggtggac 
aagtacgccc 
catgacctta 
catggtgatg 
atttccaagt 
ggactttcca 
acggtgggag 
gcttatcgaa 
aagcttggta 
ttggggtgag 
caaaaacgag 
atccatctgg 
gagggcgttg 
gcgctccttg 
aaagacggtg 
gacaaggtca 
gccgcccttg 
tgcgtccacg 
ttgcaagtct 
tgggggaccc 
^acgtagagg 
gctggcgcgc 
gctacgggcg 
tgatatggtt 
cacgaaggag 
tagggcgcag 
ccacagctcg 
cccgtcggcc 
cgaccggatc 
gatgaagcgc 
aaccggtcct 
agagagtccc 
catgcttgcg 
ccaaaatgta 
aatatctgca 
aatggtcgcg 
caaacttagc 
aacatcaggc 
tctaactact 
tggaaaacta 
gaccgtagca 
tggagccttg 
gattgattct 
ccaactaaat 
tattaactac 
ggttaaccta 
aggagatggg 
aattggccat 
ccttagtttt 
tttgtggacc 
actcactttg 
tgttaaaggc 
atttgacgaa 
tagaaatgga 
cctatcagct 
ttacttaaac 
ggaaacagga 
ccacaactac 
agaataaaag 
tgaaagattg 
tttaatgcct 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660" 

3720 

3780 

3840 

3900 

3960 
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ttgtatcatg ctattgcttc ccgtatggct 
ttgctgtctc tttatgagga gttgtggccc 
gtgtttgctg acgcaacccc cactggttgg 
gggactttcg ctttccccct ccctattgcc 
cgctgctgga caggggctcg gctgttgggc 
ctgacgtcct ttccatggct gctcgcctgt 
ttctgctacg tcccttcggc cctcaatcca 
gctctgcggc ctcttccgcg tcttcgcctt 
gccgcctccc cgcctgatcg ctagagggcc 
gccttctagt tgccagccat ctgttgtttg 
aggtgccact cccactgtcc tttcctaata 
taggtgtcat tctattctgg ggggtggggt 
agacaatagc aggcatgctg gggatgcggt 
cagctggggc tctagggggt atccccacgc 
tgtggtggtt acgcgcagcg tgaccgctac 
cgctttcttc ccttcctttc tcgccacgtt 
gggcatccct ttagggttcc gatttagtgc 
ttagggtgat ggttcacgta gtgggccatc 
gttggagtcc acgttcttta atagtggact 
tatctcggtc tattcttttg atttataagg 
aaatgagctg atttaacaaa aatttaacgc 
gggtgtggaa agtccccagg ctccccaggc 
ttagtcagca accaggtgtg gaaagtcccc 
catgcatctc aattagtcag caaccatagt 
aactccgccc agttccgccc attctccgcc 
agaggccgag gccgcctctg cctctgagct 
aggcctaggc ttttgcaaaa agctcccggg 
gcacgtgttg acaattaato atcggcatag 
aggaactaaa ccatggccaa gttgaccagt 
gccggagcgg tcgagttctg gaccgaccgg 
gacttcgccg gtgtggtccg ggacgacgtg 
gtggtgccgg acaacaccct ggcctgggtg 
gagtggtcgg aggtcgtgtc cacgaacttc 
atcggcgagc agccgtgggg gcgggagttc 
cacttcgtgg ccgaggagca ggactgacac 
ttctatgaaa ggttgggctt cggaatcgtt 
cgcggggatc tcatgctgga gttcttcgcc 
ggttacaaat aaagcaatag catcacaaat 
tctagttgtg gtttgtccaa actcatcaat 
tctagctaga gcttggcgta atcatggtca 
ctcacaattc cacacaacat acgagccgga 
tgagtgagct aactcacatt aattgcgttg 
ctgtcgtgcc agctgcatta atgaatcggq 
gggcgctctt ccgcttcctc gctcactgac 
gcggtatcag ctcactcaaa ggcggtaata 
ggaaagaaca tgtgagcaaa aggccagcaa 
ctggcgtttt tccataggct ccgcccccct 
cagaggtggc gaaacccgac aggactataa 
ctcgtgcgct ctcctgttcc gaccctgccg 
tcgggaagcg tggcgctttc tcaatgctca 
gttcgctcca agctgggctg tgtgcacgaa 
tccggtaact atcgtcttga gtccaacccg 
gccactggta acaggattag cagagcgagg 
tggtggccta actacggcta cactagaagg 
ccagttacct tcggaaaaag agttggtagc 
agcggtggtt tttttgtttg caagcageag 
gatcctttga tcttttctac ggggtctgac 
attttggtca tgagattatc aaaaaggatc 
agttttaaat caatctaaag tatatatgag 
atcagtgagg cacctatctc agcgatctgt 
cccgtcgtgt agataactac gatacgggag 
ataccgcgag acccacgctc accggctcca 
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ttcattttct cctccttgta taaatcctgg 4020 
gttgtcaggc aacgtggcgt ggtgtgcact 4 080 
ggcattgcca ccacctgtca gctcctttcc 4140 
acggcggaac tcatcgccgc ctgccttgcc 4 200 
actgacaatt ccgtggtgtt gtcggggaag 4260 
gttgccacct ggattctgcg cgggacgtcc 4320 
gcggaccttc cttcccgcgg cctgctgccg 4 380 
cgccctcaga cgagtcggat ctccctttgg 4440 
cgtttaaacc cgctgatcag cctcgactgt 4500 
cccctccccc gtgccttcct tgaccctgga 4560 
aaatgaggaa attgcatcgc attgtctgag 4 620 
ggggcaggac agcaaggggg aggattggga 4 680 
gggctctatg gcttctgagg cggaaagaac 47 4 0 
gccctgtagc ggcgcattaa gcgcggcggg 4800 
acttgccagc gccctagcgc ccgctccttt 4860 
cgccggcttt ccccgtcaag ctctaaatcg 4 920 
tttacggcac ctcgacccca aaaaacttga 4980 
gccctgatag acggtttttc gccctttgac 5040 
cttgttccaa actggaacaa cactcaaccc 5100 
gattttgggg atttcggcct attggttaaa 5160 
gaattaattc tgtggaatgt gtgtcagtta 5220 
aggcagaagt atgcaaagca ^gcatctcaa. 5280 
aggctcccca gcaggcagaa gtatgcaaag 5340 
cccgccccta actccgccca tcccgcccct 54 00 
ccatggctga ctaatttttt ttatttatgc 5460 
attccagaag tagtgaggag gcttttttgg 5520 
agcttgtata tccattttcg gatctgatca 5580 
tatatcggca tagtataata cgacaaggtg 5640 
gccgttccgg tgctcaccgc gcgcgacgtc 5700 
ctcgggttct cccgggactt cgtggaggac 57 60 
accctgttca tcagcgcggt ccaggaccag 5820 
tgggtgcgcg gcctggacga gctgtacgcc 5880 
cgggacgcct ccgggccggc catgaccgag 5940 
gccctgcgcg acccggccgg caactgcgtg 6000 
gtgctacgag atttcgattc caccgccgcc 6060 
ttccgggacg ccggctggat gatcctccag 6120 
caccccaact tgtttattgc agcttataat 6180 
ttcacaaata aagcattttt ttcactgcat 6240" 
gtatcttatc atgtctgtat accgtcgacc 6300 
tagctgtttc ctgtgtgaaa ttgttatccg 6360 
agcataaagt gtaaagcctg gggtgcctaa 6420 
cgctcactgc ccgctttcca gtcgggaaac 64 80 
caacgcgcgg ggagaggcgg tttgcgtatt 6540 
tcgctgcgct cggtcgttcg gctgcggcga 6600 
cggttatcca cagaatcagg ggataacgca 6660 
aaggccagga accgtaaaaa ggccgcgttg 6720 
gacgagcatc acaaaaatcg acgctcaagt 6780 
agataccagg cgtttccccc tggaagctcc 6840 
cttaccggat acctgtccgc ctttctccct 6900 
cgctgtaggt atctcagttc ggtgtaggtc 6960 
ccccccgttc agcccgaccg ctgcgcctta 7020 
gtaagacacg acttatcgcc actggcagca 7080 
tatgtaggcg gtgctacaga gttcttgaag 7140 
acagtatttg gtatctgcgc tctgctgaag 7200 
tcttgatccg gcaaacaaac caccgctggt 7260 
attacgcgca gaaaaaaagg atctcaagaa 7320 
gctcagtgga acgaaaactc acgttaaggg 7380 
ttcacctaga tccttttaaa ttaaaaatga 7440 
taaacttggt ctgacagtta ccaatgctta 7500 
ctatttcgtt catccatagt tgcctgactc 7560 
ggcttaccat ctggccccag tgctgcaatg 7 620 
gatttatcag caataaacca gccagccgga 7680 
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agggccgagc 
tgccgggaag 
gctacaggca 
caacgatcaa 
ggtcctccga 
gcactgcata 
tactcaacca 
tcaatacggg 
cgttcttcgg 
cccactcgtg 
gcaaaaacag 
atactcatac 
agcggataca 
ccccgaaaag 



gcagaagtgg 
ctagagtaag 
tcgtggtgtc 
ggcgagttac 
tcgttgtcag 
attctcttac 
agtcattctg 
ataataccgc 
ggcgaaaact 
cacccaactg 
gaaggcaaaa 
tcttcctttt 
tatttgaatg 
tgccacctga 



tcctgcaact 
tagttcgcca 
acgctcgtcg 
atgatccccc 
aagtaagttg 
tgtcatgcca 
agaatagtgt 
gccacatagc 
ctcaaggatc 
atcttcagca 
tgccgcaaaa 
tcaatattat 
tatttagaaa 
cgtc 



ttatccgcct 
gttaatagtt 
tttggtatgg 
atgttgtgca 
gccgcagtgt 
tccgtaagat 
atgcggcgac 
agaacttitaa 
ttaccgctgt 
tcttttactt 
aagggaataa 
tgaagcattt 
aataaacaaa 



ccatccagtc 
tgcgcaacgt 
cttcattcag 
aaaaagcggt 
tatcactcat 
gcttttctgt 
cgagttgctc 
aagtgctcat 
tgagatccag 
tcaccagcgt 
gggcgacacg 
atcagggtta 
taggggttcc 



tattaattgt 
tgttgccatt 
ctccggttcc 
tagctccttc 
ggttatggca 
gactggtgag 
ttgcccggcg 
cattggaaaa 
ttcgatgtaa 
ttctgggtga 
gaaatgttga 
ttgtctcatg 
gcgcacattt 



7740 
7ff00 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8484 



<210> 66 
<211> 53 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 66 

gtcactcgag gactcggtcg actgaaaatg agacatatta tctgccacgg acc 



53 



<210> 67 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 67 

cgagatcgat cacctccggt acaaggtttg gcatag 3 6 



<210> 68 
<211> 37 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 68 

catgaagatc tggaaggtgc tgaggtacga tgagacc 



<210> 69 
<211> 51 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 69 

gcgacttaag cagtcagctg agacagcaag acacttgctt gatccaaatc c 
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<210> 70 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 70 

cacgaattcg tcagcgcttc tcgtcgcgtc caagaccc 38 



<210> 71 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 71 

caccccgggg aggcggcggc gacggggacg gg 32 



<210> 72 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 
<400> 72 

atgggatcca agatgaagcg cgcaagaccg 30 

<210> 73 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 
<400> 73 

cataacctgc aggattcttt attcttgggc 30 



<210> 74 
<211> 47 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; oligonucleotide 
<400> 74 

ggtacacagg aaacaggagg ttccggaggt ggaggagaca caactcc 4 7 



<210> 75 
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<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 75 

atgggatcca agatgaagcg cgcaagaccg 



<210> 76 
<211> 30 
<212> DNA ,, 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 76 

cactatagcg gccgcattct cagtcatctt 



30 
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